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DATA ON THE GROWTH OF CHILDREN DURING 
THE FIRST YEAR AFTER BIRTH 


BY HELEN THOMPSON 
The Clinic of Child Development, Yale University 


S PART of the study on infancy begun in 1927 at the Child 

Development Clinic of Yale University, data on physical growth 

were obtained. Measurements of nine dimensions were made every four 

weeks on healthy infants from 6 weeks to 52 weeks old. The series is 

a mixed longitudinal one; the majority of children appear more than 

once in the records, but only a few continue uninterruptedly from 8 to 
52 weeks. 

These measurements have already been reported cross-sectionally 
(1); that is, the overall mean and standard deviation for each age 
period was given, to serve as standards for size of other infants in 
similar or different circumstances. Changes in body proportions during 
this first year have also been discussed (2). The specifically longi- 
tudinal aspect of the data, however, has not yet been taken advantage 
of, and the actual raw figures have never been published. This paper 
repairs both these omissions. 

The data in this study were collected in connection with the Yale 
University Clinic of Child Development Normative Study of Infancy, 
Dr. Arnold Gesell, Director, with whose permission this article is 
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published. The analysis was suggested by Dr. J. M. Tanner and the 
statistics compiled by Dr. Tanner and his assistant, Mr. R. H. White- 
house, to whom the author expresses her appreciation. 

In Tables 1 and 2 are given the means and standard deviations of 
the increments of growth over four-week periods. These figures are 
calculated from the actual individual increments; they are not the 
same as those obtained by subtracting one cross-sectional mean from the 
next one. This point is elaborated and discussed in the accompanying 
paper by Tanner on the reporting of growth data (3). The number 
of children for each incremental period is distressingly small, but better, 
after all, than nothing; and no incremental data of this sort except for 
length and weight have yet appeared. The work-up of this sort of data 
should normally include the correlations between all the increments, 
but here the numbers are much too small to warrant working them out. 
Likewise no individual curves have been fitted, as very few children 
have long.enough records for satisfactory estimates either of form or 
constants to be made. It should also be noted that ideally for mean 
increments such as these to be valid standards, the individual monthly 
increments should be evenly distributed over the months in the year; 
i.e. the 8 to 12 week increment should have some January increments, 
some February and so on, since increments vary with the season. The 
present data are too few for the seasonal effect to be taken into account. 

The remaining tables (Tables 3 and 4) give the raw data; no others 
appear to have been published for this age with the exception of 
Davenport’s (4), which are slightly smoothed and thus depart some- 
what from the raw state. In these tables a few figures which seem to 
be clearly erroneously recorded are enclosed in parentheses: these were 
deleted in the calculations for Tables 1 and 2. Both the material and 
the methods of measurement have been fully set forth previously (1). 
Only the salient points are recapitulated here. 


MATERIAL 


The records for 32 male and 36 female infants are sufficiently exten- 
sive to warrant recording; the number of attendances for these children 
varies from 3 to 24. They were from homes of average socio-economic 
status in the New Haven area. All the parents were American-born, 


the majority of the grandparents also. The infants were all born at 
full term, were healthy, and developed normally in their behavior. 
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INFANT GROWTH DATA 


TABLE 3 
Raw data on males 





VERTEX 
HEIGHT 


STERNAL PHYSIS 
HEIGHT HEIGHT 


BIA- BI- 
CROMIAL CHEST CHEST CRISTAL 
DIAM- BREADTH CIRCUM. DIAM- 

ETER ETER 


SYM- 
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23.5 13.6 11.7 38.1 10.2 
25.1 14.8 12.9 40.2 10.9 
25.6 14.6 13.1 41.1 10.9 
26.7 14.8 13.3 41.8 11.5 


20.7 13.2 9.2 34.0 11.2 
21.6 12.6 9.9 35.6 11.2 
23.7 13.2 12.2 38.8 10.7 
24.9 14.8 11.9 38.2 10.4 
26.2 15.5 12.2 38.8 10.9 
26.8 15.6 12.7 40.4 11.4 
27.2 15.7 13.1 40.7 11.6 
28.1 16.2 13.5 41.0 11.9 
28.2 15.9 13.8 41.9 11.9 
28.7 16.8 14.2 43.5 12.1 
30.3 16.7 14.2 44.0 12.2 
30.9 16.9 14.3 44.1 12.3 
31.9 17.3 14.9 44.9 12.6 
32.7 17.2 15.1 45.5 12.8 
33.2 17.7 15.4 45.3 13.2 
36.2 18.7 15.8 48.7 13.9 
43.6 20.1 _— _— 16.1 
47.6 21.6 16.0 49.2 16.8 
51.1 21.7 16.8 49.6 16.4 
53.0 22.8 17.0 52.8 17.9 
56.2 23.8 17.5 53.3 18.4 
58.1 24.5 18.7 54.5 18.9 
60.0 24.3 18.3 55.1 19.2 

_ 24.4 — _— 19.5 
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TABLE 3—Continued 


Raw data on males 

















BIA- 
SUPRA- SYM- 
VERTEX CROMIAL CHEST CHEST CRISTAL 
AGE HEIGHT ar PHYSIS “DIAM- BREADTH CIRCUM. WEIGHT 
GHT HEIGHT 
ETER 

Subject M.3 
8 wks. 57.4 42.0 22.5 12.4 -- 37.6 9.5 5.10 
12 wks. 63.0 48.0 24.5 18.2 11.2 38.8 11.8 5.92 
16 wks. 62.9 47.3 24.6 15.8 13.4 44.5 11.6 6.87 
20 wks. 64.8 48.3 25.0 16.6 13.6 44.5 5 Bey 7.47 
24 wks. 67.2 50.4 26.3 16.1 13.7 46.7 11.8 8.29 
28 wks. 68.2 51.4 26.7 16.6 14.2 46.9 11.9 8.46 
32 wks. 69.5 52.4 28.7 16.9 14.8 47.3 12.3 8.92 
36 wks. 70.4 52.7 28.3 16.6 14.5 47.2 12.2 9.33 
40 wks. 71.7 53.4 29.7 16.8 14.8 47.9 fly 9.61 
44 wks. 72.0 54.2 31.4 17.2 14.8 47.4 12.3 9.80 
48 wks. 73.9 55.4 31.3 17.5 15.7 47.6 33..7 10.10 
56 wks. 76.7 57.7 32.1 18.1 15.8 49.1 13.1 10.60 
80 wks. 81.8 62.8 35.7 17.8 16.7 50.7 13.6 11.84 
2 yrs. 83.4 65.3 39.9 19.7 16.8 51.8 14.8 12.77 
3 yrs. 93.6 71.5 41.4 20.3 — — 16.6 15.52 
4 yrs. 102.2 79.4 48.0 22.1 18.5 56.2 17.0 17.50 
5 yrs. 108.1 84.5 52.1 23.7 18.8 56.7 17.6 18.94 
6 yrs. 115.1 90.6 56.3 24.9 — 58.5 18.6 20.46 
7 yrs. 121.7 96.7 60.7 26.0 — 60.5 19.6 23.13 
8 yrs. 128.3 102.7 63.9 26.2 —- 61.8 20.4 26.79 
10 yrs. 138.0 112.4 70.8 27.0 — 65.4 21.6 31.10 

Subject M.4 
16wks. 62.0 465 240 132 11 404 96 5.82 
20wks. 63.5 46.7 242 145 121 421 10.2 6.34 
24 wks. 64.9 48.0 25.0 15.1 12.1 42.8 10.3 7.15 
28 wks. 66.0 49.6 26.3 — 12.5 42.3 10.6 7.40 
40 wks. 69.3 51.9 27.9 15.6 — 45.1 11.1 8.46 
44 wks. 69.9 §2.3 27.9 16.5 13.8 47.4 11.6 9.14 
48 wks 71.7 54.5 29.7 1 ig Pe 14.5 47.9 11.8 9.69 
52 wks 73.6 55.3 29.9 17.7 15.2 47.8 12.4 10.21 
56 wks 74.4 56.1 30.0 17.8 15.2 48.6 12.4 10.46 

Subject M.5 
16 wks. 65.6 48.9 27.6 14.4 11.6 42.6 10.9 1.8 7.37 
20 wks. 68.2 §1.2 28.5 14.7 12.6 44.6 10.5 3.2 8.17 
24 wks. 71.2 52.9 29.8 15.4 12.3 47.2 10.7 4.0 8.99 
40 wks. 77.1 57.2 33.7 16.6 14.6 46.0 11.8 6.3 10.85 
44 wks. 77.9 58.9 33.4 17.7 14.4 50.7 12.0 6.8 11.66 
48 wks. 79.6 59.4 35.0 17.6 14.5 51.0 12.4 7.5 11.73 
52 wks. 81.0 61.0 35.6 18.1 14.8 49.6 12.1 8.0 12.15 
56 wks. 81.9 62.0 36.3 17.8 15.9 51.6 12.9 8.3 12.72 

Subject M.6 
16 wks. 64.3 48.2 25.1 14.4 11.6 41.4 9.8 6.63 
20 wks. 66.1 49.6 26.2 15.1 12.6 41.3 10.2 7.14 
24 wks. 68.0 50.3 > ee 15.3 12.6 44.2 10.6 7.84 
36 wks. 72.6 53.9 31.1 16.4 12.7 43.6 11.6 8.49 
40 wks. 72.7 §4.2 30.5 16.7 13.1 45.6 11.6 8.93 
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INFANT GROWTH DATA 





TABLE 3—Continued 
Raw data on males 





CROMIAL CHEST CRISTAL HEAD 


VERTEX 


AGE 


STERNAL PHYSIS 


HEIGHT HEIGHT BREADTH CIRCUM. 


HEIGHT 





Subject M.7 


16 wks. 
20 wks. 
36 wks. 
40 wks. 
44 wks. 
48 wks. 
52 wks. 


Subject M.8 


16 wks. 
32 wks. 
36 wks. 
44 wks. 
48 wks. 
52 wks. 


Subject M.9 


16 wks. 
20 wks. 
24 wks. 
28 wks. 
32 wks. 
48 wks. 
52 wks. 


Subject M.10 


28 wks. 
40 wks. 
44 wks. 
52 wks. 


Subject M.11 
16 wks. 
20 wks. 
24 wks. 
40 wks. 
48 wks. 


Subject M.12 


20 wks. 
24 wks. 
28 wks. 
32 wks. 
52 wks. 
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TABLE 3—Continued 
Raw data on maies 
BIA- BI- 
SUPRA- SYM- 
VERTEX CROMIAL CHEST CHEST CRISTAL HEAD |. 
AGE HEIGHT yon nsng aenane DIAM- BREADTH CIRCUM. DIAM- circum. W=IGHr 
ETER ETER 
Subject M.13 
20 wks. 66.5 49.6 27.3 15.7 13.6 44.5 11.6 42.5 8.25 
24 wks. 67.9 50.0 26.8 15.1 13.3 45.6 12.2 43.2 8.53 
28 wks. 69.0 51.5 27.6 15.4 13.9 = 11.9 43.2 8.70 
36 wks. 71.6 53.8 28.8 a a — — — 9.41 
40 wks. 72.5 53.5 29.6 17.4 14.2 46.4 12.6 45.5 9.25 
44 wks. 73.7 55.9 30.4 17.1 14.6 46.9 12.5 45.7 10.22 
48 wks. 75.6 56.7 31.6 18.1 14.2 46.3 12.8 46.5 10.00 
52 wks. 77.2 58.0 32.2 17.9 14.6 46.6 12.4 46.9 10.44 
Subject M.14 
28 wks. 69.6 52.4 27.8 15.4 13.1 44.6 11.4 43.8 8.61 
32 wks. 70.6 53.3 29.1 15.3 13.2 44.5 11.6 44.5 8.73 
40 wks. 72.2 54.4 30.3 17.3 13.2 46.3 12.6 45.5 9,20 
48 wks. 74.9 56.8 31.7 16.8 13.6 48.9 12.4 46.4 10.24 
52 wks. 76.5 58.5 32.4 os 13.5 = 12.3 46.6 10.51 
Subject M.15 
16 wks. 62.5 45.4 24.3 14.8 12.2 41.4 11.7 41.6 5.95 
20 wks. 64.5 46.9 25.7 15.3 12.7 42.4 11.9 42.6 7.18 
24 wks. 66.3 48.7 26.2 16.6 13.2 44.5 11.7 43.7 7.69 
28 wks. 67.5 50.0 26.4 16.1 13.9 46.0 11.7 44.0 8.21 
52 wks. 74.9 55.1 30.5 17.9 15.7 49.5 12.5 47.5 10.67 
Subject M.16 
28 wks. 64.7 48.0 25.7 15.7 14.4 47.5 12.2 44.9 8.46 
32 wks. 66.8 49.5 27.5 16.2 15.4 49.5 12.4 46.0 8.93 
36 wks. 67.7 50.2 27.5 16.8 15.4 49.4 12.4 46.6 9.50 | 
40 wks. 69.0 50.5 27.9 16.9 15.5 §1.2 12.4 47.2 9.86 
44 wks. 69.5 51.2 28.9 17.4 — — 13.3 47.8 10.55 
48 wks. 70.5 53.4 30.0 Ly & 16.1 51.8 13.3 48.2 10.96 
52 wks. 72.5 53.4 30.1 17.9 16.1 51.1 13.4 48.2 11.16 
Subject M.17 
32wks. 679 49.7 262 148 126 444 111 464 818 
36 wks. 68.7 50.5 26.8 15.6 13.2 11.5 46.6 8.52 
40 wks. 69.7 50.6 28.0 15.1 13.4 46.5 11.4 47.6 9.17 
44 wks. 70.7 §2.1 28.2 15.7 13.6 45.6 11.7 48.0 9.12 
48 wks. 71.6 52.7 28.5 16.4 13.9 44.9 11.8 48.2 9.25 
52 wks. 72.6 53.6 28.8 16.6 14.1 46.4 12.2 48.4 9.32 
Subject M.18 
28 wks. 67.1 50.6 26.2 15.1 13.7 42.6 10.9 43.3 7.56 
36 wks. 70.1 52.4 27.9 16.1 14.4 44.2 11.2 44.3 8.40 
40 wks. 3 53.0 28.5 17.0 14.2 46.5 11.9 44.5 8.74 
44 wks. 71.4 53.7 29.2 17.5 14.4 46.6 12.2 44.7 9.44 
48 wks. 73.3 55.3 29.9 17.8 14.2 47.0 11.9 45.0 9.41 
52 wks. 75.3 58.5 31.4 18.0 14.8 48.2 12.1 45.5 9.62 
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Raw data on males 
BIA- BI- 
SUPRA- SYM- 
VERTEX CROMIAL CHEST CHEST CRISTAL HEAD 
sheneee — seo itor DIAM- BREADTH CIRCUM. DIAM- ciRcuM. WEIGHT 
ETER ETER 
Subject M.19 
8.25 28 wks 65.6 47.9 26.7 14.4 11.7 42.7 10.1 43.2 7.09 
8.53 32wks. 67.2 49.7 27.9 — 123 426 108 443 8.07 
8.70 36wks. 680 506 288 153 180 482 111 £4448 8.37 
9.41 40 wks 69.4 51.1 28.8 14.5 13.0 44.2 11.5 45.5 8.83 
925 44wks. 706 523 295 159 135 456 111 £445.7 8.95 
on , 8 wks 70.9 53.0 30.7 15.4 13.9 44.9 12.2 46.1 9.17 
10.44 | subject M.20 
$2 wks. 70.1 52.3 27.9 17.2 15.0 47.7 12.0 44.5 9.26 
36 wks. 71.1 52.7 30.0 17.5 15.0 47.0 12.0 45.0 9.82 
8.61 44wks. 73.6 54.6 308 17.7 415.5 496 128 469 10.23 
8.73 48 wks. 75.0 55.7 31.6 18.2 16.3 50.6 13.1 47.4 10.64 
aa 52 wks. 76.5 57.8 32.3 18.5 16.7 50.6 13.3 47.7 11.11 
10.51 | Subject M.21 
8 wks. 57.2 41.2 21.5 1 12.2 36.9 38.2 4.64 
12wks. 582 428 22.4 12.7 — 38.8 105 398 5.20 
5.95 16 wks 61.7 44.3 24.9 13.9 13.7 39.5 10.8 41.0 5.86 
a 56 wks 75.3 56.0 31.6 17.4 15.5 45.7 12.6 46.2 _ 
821 Subject M.22 
L0.67 8wks. 615 45.5 236 128 120 402 95 40.2 5.54 
12 wks. 64.5 47.2 25.5 14.3 12.6 41.2 9.7 42.0 6.64 
16wks. 67.1 492 274 148 12.7 416 104 43.0 7.64 
8.46 , 6 wks. 82.3 6 35.4 18.7 15.9 13.4 49.4 12.64 
by Subject M 23. 
058 Swks. 60.6 — 236 181 11.7 358 9.7 392 4.58 
10.96 12 wks. 62.2 45.9 24.6 15.0 13.0 38.6 10.6 39.9 5.19 
1g | Wewks. 65.2 474 260 142 1380 396 106 414 6.20 
; 56 wks. 80.6 60.2 33.8 18.4 15.7 49.0 13.2 46.7 10.82 
852 | 4dwks. 546 410 216 116 108 35.0 88 3878 3.90 
9.17 6wks. 563 41.8 219 13.1 10.7 35.5 90 39.0 4.28 
9.12 8wks. 582 432 226 132 113 ° 364 96 39.8 4.60 
9.25 12wks. 60.8 45.0 243 13.9 122 88.1 98 40.9 6.15 
932 16wks. 61.5 454 25.2 1442 124 3874 10.2 410 5.48 
56 wks. 76.2 57.1 82. 17.6 15.1 47.4 12.1 46.4 9.10 
7.56 Subject M.25 
840) Gwks. 53.5 39.9 208 126 10.7 347 94 369 4.10 
8.74 8wks. 55.2 414 216 #4130 #£«©9411.4 #356 98 37.5 4.35 
9.44 I2wks. 58.2 42.7 226 124 114 #49374 #103 #389 65.10 
9.41 I6wks. 61.3 453 239 133 122 393 #108 40.1 °&#£65.91 
9.62 56 wks. 77.1 58.6 32.4 17.2 16.4 13.3 46.4 11.09 
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Raw data on males 

















BIA- BI- 
SUPRA- SYM- 
VERTEX CROMIAL CHEST CHEST CRISTAL HEAD 
AGE HEIGHT yoo nore DIAM- BREADTH CIRCUM. DIAM- CIRCUM. WEIGH =—— 
ETER ETER | 

— ( AG 

Subject M.26 
Swks. 603 45.2 25.6 14.6 12.0 40.4 10.0 40.6 6%) —— 
12 wks. 64.4 47.7 26.8 14.0 12.6 41.0 10.7 42.0 6.19 gubjec 
16 wks. 67.4 50.9 am 15.4 13.2 43.6 10.7 43.4 748 “iow 
56 wks. 83.6 63.8 37.4 19.7 16.9 48.7 14.0 48.7 12.88 | ig y 
Subject M.27 5a 
36 wks. 70.3 51.9 292 14.7 12.9 42.5 11.5 463 810 a i 
40 wks. 72.2 53.1 30.2 16.0 13.4 45.3 11.8 47.0 852 | 99 y 
44wks. 73.8 54.0 30.2 16.1 13.6 45.0 12.1 47.4 888 96h 
52 wks. 76.5 57.4 31.6 16.4 14.7 45.4 12.3 47.7 9M) toy 
Subject M.28 2. 
32 wks. 70.7 52.1 28.4 15.6 13.5 44.0 11.2 44.0 7.68 fy 
36 wks. 714 528 292 15.7 13.7 44.5 11.5 448 72) ge 
40 wks. 72.7 54.0 29.3 17.0 14.4 49.1 11.6 452 8%) my 
44wks. 74.8 55.6 30.3 17.2 14.7 49.2 11.7 45.6 912) ‘oy 
Subject M.29 2 y 
Swks. 586 43.8 216 13.3 11.7 40.4 96 38.8 5.60 By 
12 wks. 61.4 46.0 23.4 14.2 12.5 40.6 10.6 40.1 625) gy 
16 wks. 63.6 47.2 24.3 13.6 12.7 43.2 10.9 40.9 698) 74 
Subject M.30 ; y 
8wks. 574 412 21.7 118 106 36.2 8.6 40.1 10y 
12 wks. 61.2 45.2 23.2 11.6 11.2 36.6 9.8 40.7 a | 
16 wks. 63.7 46.4 24.7 13.4 12.2 38.9 10.4 (38.7) = | Subje 
Subject M.31 F- 
Swks. 55.2 40.4 21.3 12.6 11.4 37.4 96 (36.2) 50) i¢y 
12 wks. 58.2 42.5 21.5 13.6 13.2 40.8 10.7 402 59} oy 
16 wks. 62.2 44.8 24.8 13.4 13.2 a 116 418 681) oy 
Subject M.32 te 
24 wks. 67.0 49.4 25.8 15.8 12.3 43.4 11.4 42.0 788} 960 
28 wks. 68.3 50.1 26.5 16.8 12.8 45.1 11.7 (44.5) 82! oe 
32 wks. 69.1 50.7. ~—«- 27.9 16.7 14.0 45.1 12.9 43.7 9M! gy 
48 w 
ee ——at =e 
56 w 
60 w 


ooo oe CODD 
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TABLE 4 
Raw data on females 
BIA- BI- 
SUPRA- SYM- 
VERTEX CROMIAL CHEST CHEST CRISTAL HEAD 
AGE = - HEIGHT ae ae DIAM- BREADTH CIRCUM. DIAM- circum. “#!GHT 
ETER ETER 
Subject F.1 
12 wks. 61.2 45.6 24.6 13.8 12.4 a 10.4 —- 6.10 
16 wks. 63.7 47.7 25.9 15.0 12.9 42.5 10.9 40.6 6.89 
20 wks. 66.1 49.4 28.0 15.8 13.1 44.9 11.1 41.6 7.53 
24 wks 69.2 51.7 28.6 16.2 13.6 45.0 11.5 42.3 7.87 
28 wks 70.0 52.2 29.7 16.5 13.6 44.0 11.6 43.0 7.96 
32 wks. 71.8 52.9 30.4 17.3 14.4 44.4 11.6 43.3 8.40 
36 wks. [2 53.4 30.9 17.6 14.6 45.8 11.9 43.6 8.90 
40 wks. 75.0 57.0 31.1 17.6 14.5 46.8 11.8 44.2 9.54 
44 wks. 76.3 57.1 32.4 17.6 14.5 47.2 12.2 44.5 9.72 
48 wks. 17.6 57.6 33.5 17.9 15.6 47.8 12.6 44.8 9.83 
52 wks. 78.6 58.9 34.5 18.4 15.8 48.6 12.6 45.1 10.38 
60 wks. 81.4 61.7 35.5 18.6 15.8 50.5 13.1 46.0 10.95 
80 wks. 85.2 64.5 38.4 18.2 16.1 (48.8) 13.4 46.1 10.24 
2 yrs. 90.1 68.8 40.8 20.3 16.7 51.4 14.2 46.9 12.12 
3 yrs. 97.8 74.9 46.0 19.4 16.8 51.9 15.6 47.4 — 
4 yrs. 105.2 82.3 50.0 22.4 17.4 53.8 16.5 48.5 16.72 
5 yrs. 111.8 87.8 54.4 23.9 17.9 53.7 17.3 48.5 18.14 
6 yrs. 117.6 93.0 59.9 24.6 18.8 54.5 18.1 49.0 19.94 
7 yrs. 124.3 99.1 62.9 26.3 19.3 a 18.9 a 22.31 
8 yrs. 128.3 102.5 65.1 27.0 19.7 58.2 19.4 49.5 23.4 
9 yrs. 132.1 105.9 67.9 27.4 20.2 59.0 20.4 50.0 25.11 
10 yrs. 136.1 108.9 = 28.7 20.7 — — 50.2 26.36 
Subject F.2 
8 wks. 59.7 43.5 23.5 12.7 12.0 35.8 9.4 39.5 5.16 
12 wks. 61.7 — — 13.4 12.6 38.4 9.8 40.6 5.65 
16 wks. 64.4 47.4 25.5 14.0 13.4 39.8 10.4 41.3 6.49 
20 wks 67.6 50.3 26.8 14.8 a= 42.1 10.6 42.7 6.81 
24 wks 68.3 50.9 a2 16.2 13.8 43.0 11.1 43.2 7.54 
28 wks 70.0 51.8 29.0 16.0 14.2 43.5 11.5 44.4 8.08 
32 wks. 71.2 52.5 29.5 16.2 14.5 44.7 11.6 45.7 8.85 
36 wks. 72.1 54.8 29.8 16.8 14.7 47.8 11.9 46.4 9.24 
40 wks. 73.5 55.0 30.6 16.8 15.3 48.1 12.2 46.6 9.63 
44 wks. 74.6 55.8 31.7 16.9 15.6 49.9 12.3 47.2 9.72 
48 wks. 76.2 56.5 32.0 17.8 16.4 49.3 12.6 47.3 10.64 
52 wks. 76.5 57.6 32.8 18.6 16.3 (47.6) 12.6 47.9 10.19 
56 wks. 78.2 59.2 34.2 18.7 15.9 49.6 12.8 48.4 10.97 
60 wks. 79.2 59.6 34.3 18.8 16.6 51.4 12.8 48.3 11.06 
2 yrs. 88.4 67.4 39.9 20.6 18.3 51.1 14.2 50.4 12.46 
3 yrs. 95.8 73.9 45.4 21.3 — 53.8 15.3 50.8 13.98 
4 yrs. 104.9 81.0 51.1 23.0 18.6 56.0 17.0 51.4 17.54 
5 yrs. 110.0 87.5 54.9 24.1 19.4 57.4 17.8 52.0 19.30 
6 yrs. 119.2 94.2 60.2 26.3 19.5 58.8 18.5 51.4 20.82 
7 yrs. 124.2 98.5 63.3 27.7 20.3 60.8 (17.3) 51.5 23.96 
8 yrs. 131.5 105.4 67.6 28.7 21.5 63.0 20. §2.2 28.07 
9 yrs. 136.6 110.1 71.1 29.7 22.4 65.8 20.7 53.8 31.36 
10 yrs. 142.5 115.8 75.3 30.4 23.3 73.9 22.4 55.2 42.50 
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TABLE 4—Continued 
Raw data on females 
VERTEX _SUPRA- SYM- onan. CHEST CHEST CRISTAL HEAD 
AGE HEIGHT ye ng PHYSIS “DIAM- BREADTH CIRCUM. WEIGHT 
EIGHT HEIGHT 
ETER 
Subject F.3 
12 wks 60.0 = 24.3 12.3 12.2 36.4 10.2 5.02 
16 wks 61.7 46.2 25.7 14.4 11.3 39.0 9.7 5.22 
20 wks 63.4 47.9 26.6 14.2 12.6 41.4 10.4 6.06 
24 wks 65.5 48.5 28.1 14.6 36.7 41.2 10.4 6.29 
28 wks 67.0 50.0 28.0 15.1 12.6 41.1 10.5 6.61 
32 wks. 67.7 51.0 28.8 15.7 13.2 43.6 10.7 6.96 
36 wks 69.2 51.5 29.8 15.7 13.4 42.8 10.8 7.03 
40 wks 71.1 53.2 $1.2 16.0 13.6 42.6 12.0 7.37 
44 wks. 71.4 53.5 31.3 16.3 13.9 42.5 12.0 7.57 
48 wks 73.4 56.0 32.0 16.9 14.0 44.1 11.5 V.17 
52 wks. 75.0 56.6 32.8 17.0 14.3 45.0 11.8 8.36 
56 wks. 76.2 58.1 33.9 17.2 14.6 45.3 11.8 8.59 
60 wks. 77.5 58.7 34.6 17.7 14.6 46.0 Ig 8.81 
80 wks. 82.0 62.3 37.0 18.2 14.9 45.8 12.7 9.00 
2 yrs. 85.3 65.7 39.3 18.7 15.9 46.8 13.4 9.67 
3 yrs. 95.1 73.8 45.4 20.3 16.3 49.0 15.1 12.19 
4 yrs. 103.2 80.9 50.5 7:3 16.6 51.4 16.0 14.21 
5 yrs. 111.0 87.7 56.2 22.8 17.1 52.0 16.6 15.64 
6 yrs. 118.6 94.5 60.6 24.5 17.8 53.6 17.3 17.62 
7 yrs. 123.9 99.1 64.2 25.4 18.5 54.5 18.3 19.90 
8 yrs. 129.8 104.5 68.5 25.7 19.2 57.4 19.0 22.61 
9 yrs. 134.5 108.9 70.9 27.8 19.8 58.5 19.6 25.00 
10 yrs. 139.7 113.8 74.9 26.5 20.4 62.9 20.6 28.41 
Subject F.4 
6 wks. 52.2 38.7 20.6 12.4 11.2 34.2 9.2 3.80 
8 wks. 54.7 41.0 22.1 3 11.8 = 8.6 4.00 
12 wks 56.6 42.1 22.8 12.4 12.2 38.6 9.2 4.73 
16 wks. 58.4 43.5 24.1 13.4 12.4 40.3 10.2 5.26 
20 wks. 60.5 45.0 24.2 13.8 13.2 41.6 10.6 5.94 
24 wks 62.3 46.3 25.1 14.2 13.4 42.2 10.7 _ 
28 wks 62.1 46.5 26.2 14.3 13.4 43.9 11.0 6.87 
32 wks. 64.3 47.5 26.8 14.5 13.5 44.6 11.0 7.54 
36 wks 66.8 49.9 27.6 14.9 14.2 46.4 11.2 7.92 
40 wks 67.7 50.7 27.9 15.6 14.3 47.3 11.6 8.42 
44 wks 69.6 52.4 29.1 16.7 14.6 47.4 11.7 8.48 
48 wks 68.9 51.7 28.7 16.6 15.0 48.9 11.8 9.14 
52 wks. — = — — — 49.9 — a 
56 wks. 71.7 54.0 30.9 17.5 15.0 52.4 12.3 10.14 
60 wks 73.5 55.3 $2.1 18.4 16.1 51.0 12.5 10.45 
80 wks 78.8 60.1 — 19.2 16.8 51.4 13.3 11.33 
2 yrs. 83.2 64.5 38.8 20.0 17.3 52.7 14.2 12.93 
3 yrs. 91.8 71.2 43.6 21.1 17.4 54.2 15.5 14.49 
4 yrs. 99.1 78.4 48.3 22.8 18.1 55.8 16.7 16.26 
5 yrs. 106.2 84.5 52.7 24.4 19.0 57.7 17.9 18.92 
6 yrs. 112.1 89.5 56.6 25.7 19.7 58.7 18.7 21.01 
7 yrs. 117.4 94.4 59.4 27.1 20.2 63.0 19.4 25.26 
8 yrs. 123.1 99.9 63.6 28.2 21.3 65.9 20.5 29.10 
9 yrs. 127.5 103.2 66.5 28.4 22.2 66.5 20.6 30.87 
10 yrs. 132.6 108.0 69.9 29.6 22.4 71.2 22.1 34.19 
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Raw data on females 
BIA- BI- 
, VERTEX osname, oan CROMIAL CHEST CHEST CRISTAL 
AG HEIGHT “yeIGHT HEIGHT DIAM- BREADTH CIRCUM. DIAM- 
ETER ETER 

Subject F.5 

16 wks. 60.8 44.6 23.0 12.3 11.0 39.4 10.1 6.15 
20 wks. 62.6 47.0 24.1 13.2 10.9 42.0 10.3 7.16 
28 wks. 65.8 48.4 25.7 13.8 11.7 44.3 10.9 8.22 
$2 wks. 68.7 50.8 26.7 14.8 12.5 44.1 11.2 8.46 
44 wks. 71.2 53.3 30.4 15.9 12.7 46.1 12.1 9.20 
48 wks. 72.2 54.3 30.7 15.2 12.6 46.9 12.1 9.54 
52 wks. 73.2 54.3 30.3 15.5 — 47.4 11.4 9.66 
56 wks. 74.1 55.1 32.3 16.1 14.2 47.8 12.1 9.96 
Subject F.6 

16 wks. 60.1 44.4 24.6 12.6 11.5 39.9 10.1 

20 wks 62.4 46.2 25.4 13.7 12.0 40.5 — 

28 wks. 65.4 48.5 27.1 14.4 12.6 42.0 10.4 

40 wks. 69.9 §2.1 26.6 15.6 14.2 44.2 11.5 

48 wks. 72.3 53.4 29.8 15.9 14.1 46.3 11.8 

52 wks. 73.0 54.6 29.9 16.1 14.6 46.5 11.2 
Subject F.7 

20 wks. 62.6 45.9 24.9 13.2 11.6 39.9 10.1 6.29 
24 wks 63.7 47.6 25.7 13.6 12.3 40.8 10.4 6.55 
86 wks. 67.4 49.0 27.5 15.1 11.7 44.5 10.9 7.57 
44 wks. 69.7 51.6 28.0 14.8 12.9 44.6 12.4 8.60 
48 wks. 70.8 52.5 29.2 16.3 12.7 45.5 11.6 9.01 
52 wks 72.0 53.0 29.6 15.4 13.2 45.2 11.6 8.85 
S6wks. 72.7 653.6 30.7 162 181 45.0 12.4 9.20 
Subject F.8 

16 wks. 61.6 45.7 24.4 13.9 11.8 43.1 10.5 

20 wks. 64.5 47.9 26.0 14.8 12.2 42.1 10.6 

36 wks. 69.4 51.4 29.3 14.6 (14.2) 44.2 11.8 

40 wks. 70.4 52.0 30.1 14.8 13.2 46.4 12.1 

52 wks. 73.8 55.1 31.1 15.6 13.6 45.1 12.4 
Subject F.9 

16 wks. 60.4 45.4 24.0 13.8 12.6 40.4 10.5 

20 wks. 62.4 46.8 25.8 13.7 12.4 42.4 10.6 

24 wks. 64.7 48.6 26.5 14.1 12.1 43.4 10.6 

28 wks. 65.9 49.4 27.1 14.3 13.1 44.1 11.0 

48 wks. 70.8 51.5 29.9 16.1 14.1 46.6 12.1 

52 wks. 71.6 54.6 30.7 16.7 14.6 47.9 12.2 
Subject F.10 

20 wks. 62.9 46.1 25.6 13.5 11.6 39.5 9.2 

24 wks. 64.9 48.4 26.2 - 11.4 41.0 10.4 

28 wks. 65.8 48.2 28.5 14.3 12.4 41.6 10.7 

44 wks. 70.7 51.9 29.3 14.8 12.4 41.5 10.3 

48 wks. 71.8 _- 30.0 15.8 13.4 42.3 11.2 

52 wks. 72.4 54.3 30.5 15.8 12.7 43.5 11.6 
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TABLE 4—Continued 
Raw data on females 
Re Se SS CROMIAL CHEST CHEST CRISTAL HEAD 
AGE HEIGHT STERNAL PHYSIS 
T HEIGHT HEIGHT DIAM- BREADTH CIRCUM. CIRCUM. 
ETER 
Subject F.11 
24 wks. 66.0 48.9 26.8 14.4 12.8 40.1 10.9 41.6 6.64 
28 wks. 67.0 49.9 27.9 15.0 13.4 41.9 11.3 41.9 6.73 
32 wks. 69.3 51.4 27.9 15.2 13.7 43.2 11.5 42.3 7.40 
36 wks. 69.4 51.5 29.3 16.7 14.1 43.8 11.8 42.7 7.48 
40 wks. 71.2 52.7 29.7 16.9 14.1 43.9 12.2 43.0 7.77 
44 wks. 71.8 52.8 30.1 17.4 14.2 43.9 12.3 43.4 7.82 
48 wks. 73.1 54.2 30.5 17.5 14.4 45.5 12.6 43.7 8.36 
52 wks. 74.1 55.0 31.3 17.5 14.2 44.4 12.6 43.7 8.29 
Subject F.12 
24 wks. 62.3 45.6 25.6 14.8 13.7 43.2 11.5 41.9 7.49 
28 wks. 65.2 47.7 25.6 15.2 13.6 45.9 11.8 42.5 8.00 
32 wks. 66.2 48.5 25.9 15.3 13.6 44.5 11.9 43.2 8.46 
36 wks. 67.2 49.7 26.7 15.7 14.1 47.9 12.1 44.1 8.86 
40 wks. 68.7 50.6 27.9 16.7 14.6 48.0 12.2 44.9 9.20 
44 wks. 70.0 51.6 28.6 17.1 14.7 48.5 12.4 45.3 9.49 
52 wks. 72.5 54.7 30.2 17.1 14.8 49.0 12.4 46.0 . 
56 wks. 73.8 56.3 30.8 18.1 15.1 49.1 12.5 46.3 
80 wks. 80.6 61.1 34.5 17.9 15.9 50.1 13.5 47.4 
2 yrs. 85.6 65.4 38.2 20.3 16.5 50.8 14.7 48.4 
Subject F.13 
32 wks. 69.6 52.8 29.2 15.7 14 42.3 11.6 44.2 7.5 
36 wks. 71.1 52.7 29.9 16.6 — 44.4 12.2 44.9 7.8 
40 wks. 72.5 54.2 30.6 17.2 14.6 44.8 12.5 45.3 8.2 
44 wks. 74.2 55.7 30.8 17.7 15.3 45.1 12.5 45.6 8.6 
48 wks. 75.0 56.2 32.1 18.2 15.6 45.8 12.5 46.0 8.83 
52 wks. 76.6 57.5 32.8 18.4 15.8 46.6 12.8 46.2 9.30 
56 wks. 77.9 58.5 33.3 18.2 15.2 46.5 12.8 46.5 9.25 
Subject F.14 
28 wks. 66.0 49.7 27.6 14.6 12.2 41.5 9.8 41.5 6.34 
36 wks. 69.0 52.2 29.1 15 12.7 42.4 10.6 42.4 5.80 
40 wks. 70.8 53.1 30.1 — 12.7 43.4 10.6 42.9 7.18 
44 wks. 70.9 53.8 30.7 15.0 13.2 42.2 6 42.2 6.94 
48 wks. 72.1 54.8 $1.1 15.6 13.2 42.9 11.2 43.4 7.94 
52 wks. 73.0 54.9 32.0 16.0 13.2 44.2 11.7 44.2 7.84 
Subject F.15 
32 wks. 67.9 50.4 27.0 15.8 14.2 43.5 11.1 43.0 7.59 
36 wks. 69.7 51.6 27.6 16.8 14.2 _- 11.7 43.4 8.00 
40 wks. 70.7 53.0 28.5 17.2 15.0 44.2 12.3 . 44.2 8.74 
44 wks. 72.6 54.6 30.1 17.6 14.6 46.4 12.1 44.7 9.04 
48 wks. 73.7 55.4 30.6 18.3 14.6 47.2 12.3 44.9 9.32 
52 wks. 75.8 57.3 31.6 18.8 15.4 47.5 12.4 45.5 9.64 
56 wks. 77.2 57.7 33.1 — == = — _ 10.14 
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TABLE 4—Continued 
} Raw data on females 
BIA- BI- 
—.. SUPRA- SYM- 
VERTEX CROMIAL CHEST CHEST CRISTAL HEAD 
AGE HEIGHT icone ie DIAM- BREADTH CIRCUM. DIAM- circum. W=!GHT 
IGHT ER ETER 
; 
~~ | Subject F.16 
| wks. 59.9 44.2 23.8 13.9 10.8 40.9 8.8 29.9 5.19 
64 o4wks. 62.4 45.9 24.9 14.3 11.9 41.7 10.5 40.8 6.20 
73 98wks. 64.4 47.6 25.8 15.0 12.9 43.6 11.4 42.1 7.08 
40 52wks. 72.5 54.5 31.4 16.5 13.4 46.8 12.6 44.4 8.90 
48 
17 Subject F.17 
82 | wks. 60.7 45.2 24.2 15.4 12.6 — 12.0 40.0 6.26 
36 %wks. 63.2 47.3 25.0 15.5 13.4 42.8 11.1 40.9 6.70 
29 8wks. 63.8 46.4 26.2 (144) 138.38 44.0 10.8 41.7 7.15 
52wks. 71.0 53.3 29.6 16.5 14.2 45.9 12.2 44.9 9.14 
49 Subject F.18 
00 wks. 66.5 49.3 28.7 14.2 11.7 41.2 11.1 42.3 7.08 
-46 8 wks. 66.0 48.4 — 14.3 11.9 42.7 10.7 43.1 7.46 
86 | gwks. 68.5 50.7 81.5 — 12.5 44.5 12.5 44.0 8.06 
y 52wks. 73.5 54.5 — _ ome os a 45.6 8.75 
1) | Subject F.19 
3 Swks. 58.5 42.8 23.4 13.9 11.8 36.8 10.0 37.6 4.48 
3] wks. 61.8 46.0 25.1 13.7 12.2 39.2 10.6 39.0 5.35 
i6wks. 64.4 47.6 25.9 13.2 13.9 40.8 11.2 38.9 6.20 
' 56wks. 80.2 59.5 34.6 19.4 15.8 48.4 13.2 45.4 10.75 
= "Subject F.20 
‘1 | Swks. 57.4 42.4 23.2 12.5 10.8 36.5 10.0 37.8 4.92 
"67. «| «W2wks. 59.4 44.2 24.4 13.5 11.7 37.4 9.8 39.3 5.37 
"93. «| «‘l6wks. 59.8 44.7 25.5 13.0 11.6 38.6 10.2 40.5 5.54 
99 «| S6wks. 75.3 56.0 31.4 17.0 14.7 47.0 12.0 47.2 10.26 
25 
Subject F.21 
Swks. 55.4 41.4 20.0 11.7 11.0 38.5 10.4 38.8 4.70 
34 | wks. 58.0 42.4 22.0 12.8 12.3 38.6 10.2 40.2 5.62 
“39 «| «6 wks. ~= 61.4 45.1 22.5 14.6 12.5 40.0 10.2 41.5 6.17 
"1g | S6wks. 78.7 59.1 32.2 17.7 15.2 48.6 12.9 46.9 10.29 
94 | 
94 | Subject F.22 
84 8wks. 54.1 40.0 20.9 12.4 9.6 32.4 8.9 36.5 3.70 
wks. 55.2 41.8 22.4 11.6 10.8 35.0 9.2 37.5 4.25 
l6wks. 59.4 44.0 23.0 13.7 11.9 38.4 10.1 38.9 5.37 
59 | S6wks. 72.5 54.6 30.3 17.8 14.4 44.2 12.7 43.9 8.72 
00 | 
74} Subject F.23 
a 20wks. 63.7 47.5 25.7 15.7 11.3 40.9 10,8 40.8 6.00 
‘ei | cAWks. © 65.2 48.5 26.1 15.3 11.9 42.9 11.7 42.1 6.76 
4S wkS. 68.0 51.2 27.8 16.3 11.4 44.4 11.1 42.9 7.68 
52,wks. 76.5 57.7 33.5 17.7 une 48.4 13.8 45.9 10.25 
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TABLE 4—Continued 
Raw data on females 
BIA- BI- 
SUPRA- SYM- 
VERTEX CROMIAL CHEST CHEST CRISTAL HEAD 
AGE HEIGHT coo Lined DIAM- BREADTH CIRCUM. DIAM- circum, WElGi 
ETER ETER 
Subject F.24 
8 wks. 54.9 41.0 21.5 12.4 10.6 36.3 9.5 36.7 4.45 
12 wks. 57.7 43.0 22.6 12.8 10.9 37.3 10.2 37.2 5.20 
16 wks. 60.3 44.5 24.2 14.2 12.3 39.7 10.6 38.3 5.42 
56 wks. 76.6 58.7 32.9 17.7 14.8 46.2 12.7 43.5 9.51 
Subject F.25 
32 wks. 64.3 47.3 25.8 14.6 12.7 42.6 10.6 41.9 6.88 
36 wks. 66.1 48.7 26.3 15.0 13.3 42.8 10.8 42.6 7.55 
40 wks. 67.7 50.2 26.6 15.3 13.4 43.9 10.9 42.8 7.72 
44 wks. 68.4 50.5 27.5 15.6 13.5 44.6 11.4 43.5 8.34 
48 wks. 70.2 52.5 28.4 16.7 13.6 45.4 11.6 43.8 8.94 
52 wks. 72.1 54.0 29.1 17.1 13.8 46.8 11.8 44.4 9.33 
Subject F.26 
32 wks. 69.0 52.3 26.1 16.3 12.3 ae 11.2 41.6 7.78 
36 wks. 69.9 52.3 29.5 16.3 13.1 42.4 11.2 42.0 7.9% 
44 wks. 71.8 53.4 29.9 16.6 13.4 43.4 A. 42.9 8.43 
48 wks. 72.9 55.5 30.3 17.2 13.4 44.0 12.3 43.0 8.74 
52 wks. 74.0 56.2 30.5 17.6 13.4 45.0 12.4 43.3 8.72 
Subject F.27 
36 wks. 68.9 51.9 30.1 15.9 13.8 42.8 11.5 42.4 7.17 
40 wks. 70.1 52.3 29.9 16.3 — 41.6 11.6 42.4 7.21 
44 wks. 71.0 52.5 30.3 16.6 14.1 43.5 11.7 42.9 7.61 | 
48 wks. 72.9 54.3 31.2 16.6 14.1 44.6 12.3 42.9 7.92 
52 wks. 73.6 54.8 31.6 17.6 14.6 46.3 12.6 43.7 8.10 
Subject F.28 
20 wks. 64.9 48.1 26.3 16.3 12.2 45.4 11.0 39.9 7.30 
24 wks. 66.7 50.1 ry 15.7 12.9 (43.9) 10.5 40.8 7.87 
28 wks. 68.5 51.5 27.9 16.3 13.4 46.9 10.6 41.0 8.39 
36 wks. 70.9 53.9 29.6 16.2 12.9 46.1 11.3 41.7 8.36 
40 wks. 72.2 54.6 29.8 16.3 13.6 47.6 11.4 42.3 9.02 
52 wks. 75.0 — — — — — — — 9.94 
Subject F.29 
32 wks. 70.6 52.9 28.9 16.1 13.6 44.9 11.1 47.6 9.17 
36 wks. yy wl 53.3 30.1 — 13.6 44.6 11.1 48.5 9.32 
40 wks. 73.1 64.2 —~ 16.8 14.0 45.0 11.4 48.8 9.30 
48 wks. 75.4 56.3 31.6 16.8 14.7 46.4 11.5 49.8 10.10 
Subject F.30 
24 wks. 66.1 48.9 27.1 15.9 12.9 44.6 10.9 42.4 8.06 
28 wks. 67.8 50.9 28.5 16.1 13.7 45.9 11.2 43.3 8.33 
32 wks. 69.0 51.6 29.1 15.8 (12.1) 45.5 11.8 43.6 8.37 
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TABLE 4—Continued 


Raw data on females 














ae 
: ™ BIA- BI- 
WEIGH? VERTEX SusEs- SYM- CROMIAL CHEST CHEST CRISTAL HEAD 
AGE ‘HEIGHT yor pia ine DIAM- BREADTH CIRCUM. DIAM- circum. “"I@HT 
| ETER ETER 
445 Subject F.31 
5.20 8 wks. 60.7 44.3 23.7 14.0 11.0 37.3 10.3 88.3 4.90 
5.42 | 12 wks. 62.8 46.8 25.1 14.0 11.9 38.2 11.0 39.4 5.50 
9.51 16 wks. 65.2 48.0 26.1 14.8 13.6 41.6 11.5 40.1 6.64 
Subject F.32 
24 wks. 67.7 49.8 28.4 15.7 12.4 46.1 11.6 41.3 7.20 
6.88 | 28 wks. 68.4 51.3 28.5 15.3 (18.8) 46.2 11.4 42.0 7.58 
im 32 wks. 70.2 52.7 80.2 15.8 12.5 (44.1) 12.3 42.6 8.08 
8.34 Subject F.33 
8.94 | 8 wks. 58.4 42.2 23.3 13.4 11.6 39.2 9.4 39.4 5.10 
9.33 12 wks. 61.2 46.0 24.6 12.9 10.4 38.6 10.0 40.4 6.08 
16 wks. 64.2 47.2 25.7 13.6 13.2 42.3 10.6 41.9 7.18 
at Subject F.34 
1% 8wks. 654.2 39.2 21.6 10.6 10.5 35.2 8.9 38.6 4.20 
8.43 12 wks. 55.2 39.6 22.4 12.3 10.4 35.9 9.3 39.9 4.64 
8.74 | 16 wks. 57.5 41.7 23.3 12.2 11.6 37.6 9.7 41.2 5.28 
8.72 Subject F.35 
8 wks. 54.0 39.9 20.8 12.7 11.1 36.4 8.9 38.4 4.30 
12 wks. 57.2 42.9 22.0 13.0 12.5 39.0 10.0 38.7 5.20 
HS 16 wks. 59.6 44.3 23.4 12.6 12.3 40.4 10.6 40.2 6.20 
eal 
7.61 Subject F.36 
7.92 40 wks. 69.7 52.5 28.7 15.4 12.8 44.4 11.6 8.54 
8.10 } 44wks. 70.3 53.0 29.1 — 13.8 45.5 11.8 43.5 9.10 
| 48 wks. 72.0 54.1 29.4 17.0 14.3 46.8 12.4 44.1 9.88 
7.30 
7.87 | 
8.39 
8.36 
9.02 
9.94 
i 
9.32 | 
9.30 
10.10 
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MEASUREMENTS 

All measurements except those at 8 and 12 weeks were made by the 
author. Those at 8 and 12 weeks were partly made by the author and 
partly by one other person. 

1. Verter height (soles —verter). The infant is stretched hori- 
zontally on a measuring board with the soles of the feet held in contact 
with one end of the board; the anthropometer arm is brought down from 
the other end till it touches the vertex of the head. 

2. Suprasternal height (soles — suprasternal). The same, with the 
anthropometer arm brought to the suprasternal notch. 

3. Symphysis height (soles — pubes). The same, with anthropo- 
meter arm brought to crease above pubes, or to upper border of symphysis 
pubis. 

4. Biacromial diameter. Taken while the infant still lies on the 
board. The infant must be relaxed and with the shoulders drawn neither 
forward nor back. Spreading calipers. 

5. Chest breadth. Taken similarly at the nipple level; the point 
intermediate between minimum and maximum is taken if the child is 
erying. Spreading calipers. 

6. Chest circumference. Taken at the nipple level. 

7. Bicristal diameter. Taken between the most lateral portions of 
the iliac cress. Spreading calipers. 

8. Head circumference. Taken round the most prominent portion 
of the occiput, and the glabella. 
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SOME NOTES ON THE REPORTING OF 
GROWTH DATA 


BY J. M. TANNER 
Sherrington School of Physiology, 
St. Thomas’s Hospital Medical School, London 


INTRODUCTION 


MONGST students of human growth and development one fre- 
A quently hears discussed the relative advantages of longitudinal 
and cross-sectional growth studies, and the general opinion that both 
have their merits usually secures the confident agreement of all con- 
cerned. It is less often, however, that the precise nature of these merits 
is set forth, with the necessary inferences as to the statistical handling 
of each type of data. Indeed, the need for a comprehensive account of 
the biometrical handling of growth data is glaringly obvious. Its 
satisfaction would certainly be a formidable undertaking, to which 
the present paper makes no pretense. What is given here is little 
more than a series of notes arising from my own difficulties when faced 
with the extraction of information from a mass of measurements. 
But at least they deal with the problem and may stimulate some bio- 
metrically-minded biologist with a mathematical statistician at his elbow 
to attempt a real synthesis of views and methods. 

In one section of the field these notes do approach a complete 
account, at least of the simpler aspects; the appearance of Patterson’s 
paper (34) on sampling on successive occasions with partial replace- 
ment has enabled me to rewrite extensively some sections of Part 1, 
dealing with the reporting of a single measurement, such as stature. 
I am most grateful to him for allowing me to read his manuscript prior 
to publication, and to Mr. M. J. R. Healy, who put me in touch with 
him, and contributed himself a great many helpful criticisms. 

These notes are divided into two parts. Part 1, dealing with the 
single measurement, re-emphasises the importance of reporting longi- 
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tudinal data‘ in a different way from cross-sectional, and discusses the 
technique needed to get the maximum information out of the data 
from mixed longitudinal series, and from a long series of measurements 
on a single individual. It is still commonplace to see mixed longitudinal 
series reported in a purely cross-sectional manner, the mean values at 
age 7, 8, etc., being given; and subsequently in the same article to 
encounter a statement about “rate of growth” estimated as the mean 


9 


at 8 minus the mean at 7.2. Shuttleworth (1) examined this fallacy at 
some length and with the great clarity in 1937, and Wilson (2) later 
dealt with it in more mathematical terms. Yet it seems that Dr. 
Thompson’s paper (3), immediately preceding this one, is the first to 
report actual increments and their variability in a mixed longitudinal 
series for measurements other than height and weight. 

Part 2 deals with the simultaneous reporting of two or more measure- 
ments. The growth of one part of the body relative to another has been 
much written about, and often characterised amongst anthropologists 
by a ratio index, such as trunk length/leg length. Zoologists, on the 
other hand, have more often used the allometry equation introduced by 
Huxley and others, wherein the multiplicative velocities of the two 


*I have used a terminology as follows. There are three types of series: 
»ross-sectional, where each child is measured once only, and all the children at 
age 8 differ from all those at age 7; pure longitudinal, where every child is 
measured at every age, and where all the children at age 8 are the same as the 
children at age 7; mixed longitudinal, where some or all of the children are 
measured at least twice, but where some or all fail to last the entire course from 
entry age to maturity—here some of the children at age 8 are the same as at 
age 7, but some are new entrants. Pure longitudinal series can as a rule only be 
arrived at by taking from a mixed longitudinal series those cards that have been 
successfully completed, and often their number is distressingly small. For this 
reason, mixed longitudinal series have a great importance, and yet it is in the 
reporting of these that confusion seems still to occur. 

*The force of Shuttleworth’s (1) remarks on this topic has not diminished 
during the last decade. “The importance of repeated measurements on the same 
children,” he wrote, “has been urged for so long and by so many as to beome 
the first article of faith among students of child development. Nevertheless, the 
multitude of longitudinal studies which have yielded only cross-sectional findings 
suggest that this faith has been degenerating to the level of pure dogma... . 
In longitudinal studies the emphasis of the analysis should fall on the growth 
increments rather than on gross dimensions. ... Thousands of dollars have been 
spent on the collection of longitudinal data, hundreds of dollars have been spent 
on cross-sectional problems, and only dollars have been devoted to the develop- 
mental aspects of the data.” 
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dimensions are compared. Both these methods are discussed and the 
relations between them outlined. There is a brief allusion to the possi- 
bilities of dealing with more than two measurements simultaneously, 
and finally a few remarks on the relationship between allometry during 
growth and the factor analysis of adult physique. 

Throughout I have endeavoured to present the argument in such a 
way that it is intelligible to the biologist, pediatrician or anthropologist 
who is not a trained biometrician. I have assumed a knowledge of only 
very simple biometric procedures such as are implied by the words 
mean, standard deviation, variance, standard error, regression and cor- 
relation coefficient. Some of the algebraic expressions may be lengthy, 
but this does not mean they are complicated, and in an appendix I have 
given fully worked out numerical examples of all of them. A short 
study of these workings in the appendix should enable a technician 
computer to work out all the various statistics detailed below. 


PART ONE: THE SINGLE MEASUREMENT 


The hypotheses 

The way in which data of a single measurement, such as stature, 
should be reported depends on the use to which they are going to be put. 
Data are collected in order to test hypotheses, and three possible hypo- 
theses are as follows: 

A. That child P is abnormal in stature, i.e. that the chance of P 
belonging to the group G of children of which the standardising data is 
a sample, is very small; 

B. That child P is growing abnormally fast, i.e. that the chance 
of P’s increment between ¢, and ¢, years belonging to the population of 
increments of the group G is very small; 

C. That child P is growing with an abnormal acceleration, or (a 
more probable clinical event) that P’s deceleration from birth onwards 
is too slow; he continues growing at too high a speed. That is, that the 
chance of P’s deceleration from the (t.—t,) period to the (tf; —t.) 
period belonging to the group G’s deceleration over the same period is 
very small. 

We must be careful at the outset to get the exact sense of these 
hypotheses, to distinguish between populations and samples. The 
standardising data are all samples. The various extant data sample 
each somewhat different groups of children, or, in the statistician’s 
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terminology, somewhat different populations. The Brush Foundation 
data sample the population of healthy white Cleveland children of 
North American ancestry and above-average economic status. The Fels 
data sample a somewhat similar population, though perhaps not quite 
so economically restricted; the Iowa data sample is again from a similar 
population, though with a different regional restriction. The Harvard 
School of Public Health group samples a considerably lower economic 
level, and so on. The data of each of these studies may be taken for 
use in a particular problem, and it remains to be shown whether in 
fact some of these samples differ sufficiently for us to consider that 
they come from truly different populations, or could be obtained by 
repeated sampling of the same one. In the statistical sense these samples 
are undoubtedly very poor ones. The children included in the sample, 
or Growth Study, were not selected either at random or by some other 
fixed procedure from the population which they are a part of; they 
were included for accidental reasons, such as willingness to co-operate 
or being personally known to the investigator. Doubtless the estimation 
of population figures from such samples is biased, but we can do little 
but note the fact and pass on. The administrative problems of a 
Growth Study are formidable enough already; nevertheless if the results 
are to be used as standards for child growth in the region, it is desirable 
for new studies to adopt as nearly as possible some recognised sampling 
procedure, and for the older ones to adjust themselves to some set 
pattern by controlling their future admissions. 

In testing hypotheses A, B or C, then, we are concerned with whether 
child P is to be regarded as a member of the population or group of 
which our Growth Study data is an imperfect sample. Thus we must 
assess our child P’s figure against the mean and variability of the 
population, and these two must be estimated from the mean and vari- 
ability of our sample data. 

In testing hypothesis A, in the case of stature, for example, we 
proceed as follows: Presuming the distribution of stature in the popu- 


lation of children is normal or very nearly so, the value for P is sub- 
tracted from the estimated population mean for the group of children 
(say white, North American, above-average economic status, healthy) 
to which he should on the face of things belong. The resulting difference 
is then compared to the estimated standard deviation of the population, 
If the hypothesis is that P is abnormal (either too tall or too short) 
and one wishes to misclassify only 5 children out of 100, then P is said 
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to be abnormal if his difference from the mean is twice (or more 
accurately 1.96 times) the standard deviation. If the hypothesis is 
that P is too short and if one wishes to classify as pathologically short 
(and so probably maltreat medically) only 5 normal children in 100, 
then the level of decision is at a difference of 1.65 times the standard 
deviation. It should be remembered, however, that this second, one- 
sided test is far more sensitive than the two-sided test to non-normality 
in the population distribution. Thus for such measurements as waist 
or hip circumference, which seem mildly skewed, the one-sided test is 
suspect, though the two-sided probably suitable. For more heavily 
skewed distributions, such as weight, even the two-sided test becomes 
questionable, and quite inapplicable in such very highly skewed measure- 
ments as subcutaneous tissue thickness by caliper measurements on 
tissue folds. There are various possible approaches to these skewed 
measurements ; perhaps the simplest would be by finding transformations, 
such as log weight instead of weight, for which distributions will approxi- 
mate normality. Little practical work has yet been done on this, and 
it will take considerable experience of transformed variates before we 
can be sure a particular transformation will be in all cases suitable for 
a particular measurement. Alternatively, statisticians are seeking some 
form of the test which is less sensitive to skewness (see, for example, 
reference 32), though less efficient (that is, discovers fewer pathological 
cases for a given size of standardising sample). Comparison by means 
of percentiles is often resorted to; the trouble here is that this procedure 
is less efficient in the statistical sense than would be the t-test equivalent 
on a transformed variate. The location of the population 95th percentile 
from a sample is done less accurately than the location of the point 
representing twice the standard deviation; so unless the number of 
standardising cases is very large, the point taken as 95th percentile for 
the population may in reality be nearer the 90th or 100th. 

This example is taken from the pediatrician’s viewpoint; the anthro- 
pologist may often be interested in testing the mean figures for one 
whole sample against the mean of another sample or against a population 
mean estimated from another series of samples; for this second situation 
hypotheses A, B and C read equivalently as in the first. 


Hypothesis A 


The hypothesis A is answered entirely by cross-sectional data; if 
one is only interested in the truth or falsity of A, the collection of a 
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longitudinal series would be a ridiculous expense of time and money. 
This remains true even though a series of independent samples for each 
year (the pure cross-sectional technique) is a rather less statistically 
efficient way of estimating the population means at each year than is 
a series of mixed longitudinal data, or, as Yates calls them, partial 
replacement samples. By efficiency in this connection is meant the 
number of subjects needed to give a particular standard error to the 
population mean—in other words, to estimate the population mean with 
a particular degree of accuracy. Yates (31, p. 260) finds that fewer 
subjects would be needed if a mixed longitudinal series were available, 
but not very many fewer. Consequently the cost of prolonging the study 
over two or more years and locating a proportion of the same children 
twice would far outweigh the advantage. 


Estimates of population mean from mired longitudinal series 

If, however, a mixed longitudinal series ts available anyway, collected 
for the purposes of hypotheses B and C, a somewhat improved estimate 
of the population mean at each year can be obtained; an estimate 
improved, that is, over that obtained by treating each year’s figures as 
though purely cross-sectional or independent. Wilson remarked this, 
and Yates, in a definitive monograph on sampling methods published 
shortly after the first draft of the present paper was circulated, carries 
its consideration a good deal further and with considerably greater 
clarity of exposition (31, see particularly pages 45, 175-182, 233-235, 
260-262). The results given in Yates’ book were in the main obtained 
by Patterson who has recently extended them considerably (34). 


Notation 

The problem of a suitable notation is not altogether simple. We 
have endeavored to devise one which will meet the requirements of pre- 
cision and of reasonable simplicity in typesetting, at the cost of some- 
what unfamiliar symbolism: 


N, = total number of subjects at time h, 
m, = mean of all subjects at time h, 
v, = variance of all subjects at time h, 


S, = Vv, =standard deviation of all subjects at time h. 


The next group of symbols relates to subjects present at time h who were 


also present at time (hk —1): 
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Ly, = number of such subjects, 
1, = mean of these subjects at time h, 


V(l,) = variance of these subjects. 
For subjects present at time h but not present at time (kh —1) we use 
Cn, Cr, V (ex) = number, mean, and variance. 


We shall also need to consider subjects present at time A according to 
their presence or absence at time (kh -+-1). These we denote by primed 
symbols, 

Ln, Un, V(Un), and C’n, &n, and V(c',), respectively. 


Note that 
Na=1Iyn+ Cy = L'n + Cr. 


We shall also want 


cov (l’x1, 4) = covariance of measurement of subjects 


present at both times h and (h —1). 


We remark that variances and covariances are calculated with divisor 
n—1 rather than n. 

Let us designate by A the percentage longitudinality from ¢, to ¢2, 
that is, the proportion of children present at time ¢, who were also 
present a year earlier at ¢;. Let » be the percentage cross-sectionality, 
or proportion of children present at ¢, and not present at ¢t,:»—1—A. 
Now if we have only a purely cross-sectional sample at t, from which 
to estimate the population mean for that age, the best estimate actually 
equals the sample mean. But if we have some members at ¢, previously 
present at ¢,, there are two ways in which the ¢, population mean can 
be estimated. It can firstly be estimated by reference to the longi- 
tudinal element only. This is done by taking the longitudinal element 
sample value at ¢, and adding to it the mean increase from ¢, to f, 
obtained by the regression of the longitudinal element ¢, values on their 
corresponding ¢, values. The population mean at f, can secondly be 
estimated from the new values at ¢., the cross-sectional element not 
present at f,. This is done quite directly, as if the longitudinal element 
did not exist. These two estimates of the ¢, mean are independent, and 
the statistically single best estimate of the ¢, mean is obtained by 
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weighting and combining the two. Yates gives the expression for this 
estimate of ¢, mean as 


L2{lz + b(mi —V'1)} + C2(1 — wr*) {62} 
M,= Lz + C2(1 — pr’) > A 


where b is the regression coefficient of measurements at f, on measure- 
ments at ¢,, and r is the correlation coefficient between these measure- 
ments. Yates remarks that if the standard deviations on both ¢, and ¢, 
occasions are nearly equal (which is sometimes but not always the case 
in growth data) it is better to replace b in equation A by 1, since r is 
less subject to errors of estimation. The fact that expression A is the 
weighted combination of the two possible estimates is not immediately 
apparent when it is presented in this form, but it may be noted that 
the part inside the first curly bracket represents the regression estimate, 
and the single term in the second curly bracket ¢., the estimate from 
the added cross-sectional ¢, values. 

Equation A, then, gives us the best estimate of the population mean 
value on the second of two occasions given a mixed longitudinal series 
covering two years only. It should be used in testing hypothesis A 





under these conditions. 

Such a situation is unlikely to occur in growth studies; it is too 
restricted. As a rule there are not two occasions but nearer ten or 
more, and if we have a mixed longitudinal series given us, one would 
like, in estimating the population mean at t,, say, to take into account 
not only the values at ¢;, but also the information provided by the 
values at ¢,, ¢2, ¢;, and ¢,. This information can be worked in by 
combining numbers of independent estimates as for the simple 2-year 
case above, but is best presented in a slightly different form, as a 
recurrence equation relating the mean at ¢, to the mean at ¢,, the 
mean at ¢, to the mean at ¢,, and so on. Patterson (34) gives this 
expression, relating M,, the best estimate from all the previous years 
of the population mean at year ¢,, to M,_,, the estimate of the popu- 
lation mean of the preceding year ¢,_,, as 


M, = (1 $a) a + * (Maa —Taa)) + gata, B 


where ¢, is defined below, and where r is an overall estimate obtained 
from all the groups of (h,i—1) values in the data, and is calculated 


as 
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SX { (Una — V’n-1) (Un a In) } ; 
VSn3 (Una — Va)? V 83(th — hh)? 


Where S), indicates that the totals of the sums over all pairs of years are 
to be taken. (Details of computation are in the appendix.) This 
method of calculating r is adopted because r is, of course, merely an 
estimate of p, the population correlation coefficient, and ex hypothesi 
for the formulae of this section to be valid p does not vary from one 
pair of years to the next (see below); thus each set of paired thas 
values furnishes an estimate of p, and the more used in the estimation, 
the better. 

The value of ¢, is given by a recurrence relation in terms of ¢j_1: 











1 — fp = Ps ’ Cy-1 = 0. C 
Ni — ren( 1 wort In) 


In the special case where the same fraction of units is replaced each 
year and the number of units is constant, we have a somewhat simpler 


expression 











rv 
1 —_— —_— ° . D 
* 1 — r*(u— Adp-1) 
Under these conditions ¢, approaches, for increasing h, the limiting 
value 
dr) + Vi-FVvi— Fd — De) 
Piim — 2\r2 . E 


For any A greater than 2 the use of equation E leads to little loss of 
accuracy in estimating M),; but only for h >3 can equation E be 
safely used in estimating variance. In any case the assumption of a 
constant replacement fraction is rarely realised in growth studies, so that 
we are usually forced to use equation C. 

Equation C gives, it may be noted, 


on = 0, if Cy = 0, and 
da = 1, if In. 


In the case where Cy_; = 0, equation C is indeterminate (since ¢,-, is 
then also zero). In such a case we are forced to use the expressions, 
given in Patterson’s paper: 
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1—¢_ on a C’ 
Ni— r’C,{1 — Ly) (Mn- )/Vr-1} . 
V(M,.,) ieee r? 47 V(Mi-2) | 6 at Qo” 
Va-1 Nava Vn-2 
J (My-2) == Pes ’ Guu AX 0. qv” 
Un-2 Ch-2 


Here V(M).,) is the variance of M)_,, and can be calculated provided 
Cy» is not zero. If Cy-o—0, we can take the recurrence relation 
farther back, until we find some (), not zero. An illustration is 
provided in the appendix. Notice that if C, is zero as well as Cy., 
then ¢, = 1, and no difficulty arises (see Appendix, Example 1). Ifa 
year occurs at which there are no subjects at all, i. e., VY, = 0, then for 
the following year ¢, is calculated from equation C, using Cy, and 
¢dn-2 for Cy, and ¢y_,, and substituting r* for r’. 

All these formulae B, C, D and E assume one thing which must 
be made quite explicit. They are strictly true only if the correlation 
coefficients of the longitudinal elements remain constant for each adjacent 
pair of years, i.e. r for the ¢, and ¢,,, values equals r of the thy: thee 
values and so on. Secondly, the assumption is made that the correlation 
between values two years apart is r*, three years apart r*, and so on,— 
that is, that the currelation of ¢, and ¢,,. values equals r*, of the ¢, and 
ty.3 Values, r°. These conditions seem to be well approximated in human 
growth data except perhaps in the first postnatal year and to a smaller 
extent during adolescence. In addition Patterson has shown that a fairly 
small departure from these conditions affects his estimates to a very 
small degree only. A test of the second of these conditions is shown in 
Fig. 1. The heavy lines represent the condition when frp.,-2 = r7p.r-, 
Th.r-3 = T*p,n-1 and so on, with the first r».,_, at the four different levels 
.98, .97, .96 and .95. The various thin lines represent actual correla- 
tions from all the published material available. In the left half of the 
figure are plotted series for ages before adolescence, t, being from 9 to 
12 (i.e. the highest age being this). In the right half of the figure 
the ages include adolescence, ¢, being 16 or 17. It will be seen that 
prior to adolescence the data fit the hypothetical conditions quite 
remarkably well, but during adolescence, as is well known, r decreases 
and the thin lines depart from the condition given by the thick. So 
far as the evidence of these data goes, however, the disturbance during 


adolescence seems to be temporary; once adolescence is over, the thin 








C’ 


wr 
J 





CORRELATION COEFFICIENT 


Heavy lines represent condition for four values of Ty,a-1. 
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CORRELATION COEFFICIENT BETWEEN VALUES AT t, AND PREVIOUS YEARS ty.,ty.....thie 


Fig. 1. TEST OF CONDITION fy ,n-% = 1*n,n-1 


Light lines actual 


correlations from various data, for years before adolescence (Fig. 1A) and years 


including adolescence (Fig. 1B). 


Fig. 1A. 
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Years before adolescence: 


35 boys) bi-iliocristal, pure longt. 

20 girls( h=9. Meredith & Carl, 1946 
Stature) 275 girls, Harvard Growth 
Weight} Study, pure longt. h = 12 
Boys Stature. Brush data, mixed 
Gilet longt. h = 12. Simmons, 1944 


1B. Years including adolescence: 
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Stature) Harvard Growth Study, 275 
Weight ( girls, pure longt. h = 16. Wilson, 1935 
Girls Stature, Brush Foundation data, 
Boys} Simmons 1944, mixed longt. 

h = 16 girls, 17 boys. 
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lines seem to get back onto the thick tracks at just about where the 
condition demands they should be. (The figure, incidentally, demon- 
strates graphically the well-known fact that prediction of adult stature 
can be made better from stature before adolescence than from stature 
during it.) 

We have now got an equation for the best estimate of population 
mean at ts, say, when the values at ¢,, ts, t;, ¢, and ¢; are all taken into 
account. But the same principle can be pushed still further; we can 
obtain a still better estimate for ¢, if we also take into account the 
values at t;, ts, to, tio, and so on. Obviously if we are casting up our 
results for ¢, at a time when none of our children have reached t; we 
cannot use this extra information; it is still to come. But very often 
we do have a lot of values for ¢t; and thereafter. If our growth study 
is of the sort that has a distinct end as all its subjects reach maturity 
more or less simultaneously as in the Harvard Growth Study, we should, 
at least in theory, recalculate all our data when the end is reached, 
taking every bit of information into account for each year’s estimate. 
If our growth study, like the Fels, is built on a continuous intake of 
children, we should every so often compute completely new estimates 
for our standards, taking into account all the data available and 
modifying our previous estimates accordingly. 

Unfortunately the expression for the best estimate of the mean of ft, 
taking into account all the values for years both before and after ¢, is 
computationally complicated, and Patterson confines himself to giving 
the estimate which includes information from only a single year after 
t, in addition to all the years before it. We will denote this estimate 
by x+:M, to distinguish it from the estimate M, of equation B which 
did not take into account the values available at ¢,,,. The equation 
giving a:M, is 

soy = My — ete 


SruiOn 


{Mas — nis}, Ch a 0), F 


where M, and M,,, are obtained from equation B and its equivalent for 
My.:, and ¢, from equation C. When C, = 0 equation F no longer 
holds, and as in the case of ¢, some added complexity results. The full 
expression for y.:M) is 

Cun VO) 


Sh+1 Uh 


{Mass — Ensr}- ¥’ 


her), = M), == 7S, 


This reduces to equation F when C, +0, but otherwise must be used 
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directly, with a value for V(M),)/v, found from equations C” and C’” 
with the appropriate changes of subscripts. An illustration is provided 
in Example 2 in the Appendix. Note however that if Cy,; 0, then 
ney = My, whether or not C,—0, so that if Ca—0 it is never 
necessary to use equation F’. 


Relative efficiency of estimates of yearly mean size and mean velocity 
of growth 


We have now seen that, given a mixed longitudinal sample of 
children, there are several ways of estimating the population mean 
value at each year for a given dimension. We can use the data as 
though they were purely cross-sectional and take as our population 
value m,, the mean of all the measurements at t,. We can take into 
account values prior to ¢, and take as our estimate M), from equation B. 
We can take into account these values and the additional value at ty,1 
and use »,,./, from equation F. Finally we can in theory take into 
account an infinite number of after-h readings and estimate the mean 
as ~My. It is essential to realise exactly how these different estimates 
compare with one another. All are unbiased, that is to say, as the 
number in the sample increases, all get nearer and nearer to the true 
population mean value. But their efficiencies differ greatly, and increase 
in the order given above. That is to say, the number of cases needed 
to establish the population mean securely within given limits—perhaps 
5 per cent of the mean value—will be considerably greater if we take 
m, as our estimate than if we take M,, and M, in turn will need more 
eases for its establishment than will »,,4M,. In other words, we are 
letting a great deal of information slip through our fingers if we use 
m, (and spending a great deal of extra money for a given accuracy). 
We use more of our available information by calculating M), still more 
by calculating 4.,:4,, and the maximum possible by calculating <M). 
Patterson gives the relative efficiencies of the three estimates in relation 
to 4M, for various values of r and two values of A, and his figures are 
reproduced in Table 1. In the left-hand half of the table are the 
relative efficiencies for the estimate of the population mean ; for example, 
at the very usual growth study figures of r= .95, A = .67, the efficiency 
of m, as an estimate is only .33 or 33 per cent of that of .M,; M, is 
66 per cent efficient, ,,:M@, is 76 per cent efficient. It will be noticed 
that r affects the efficiency much more than does A, but as both get 
higher so the necessity of using M) and »,,M) becomes greater. Patterson 
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has assumed equal replacement fractions, i.e. equal A, from one year 
to the next and equal standard deviations at each year in constructing 
this table, but removal of these restrictions will not affect the figures 
very much. 

TABLE 1 


Efficiencies, relative to the most accurate possible estimate, of population yearly 
means and year-to-year increments from mixed longitudinal series 
Taken or computed from Patterson (34). 





ESTIMATES OF POPULATION ESTIMATES OF POPULATION MEAN 
MEAN AT YEAR Ah BY INCREMENT FROM YEAR h—1 
TO YEAR h BY 











r my M, nerdy m,—m,., M,—M,., Miner, M,—,-M, 
~A=1/2 
0 1.00 1.00 1.00 1.00 1.00 1.00 1.00 
.25 0.97 0.98 1.00 0.97 0.98 0.99 0.99 
50 0.87 0.93 0.99 0.85 0.93 0.96 0.97 
.60 0.80 0.90 0.98 0.76 0.90 0.94 0.96 
.70 0.71 0.86 0.97 0.65 0.86 0.92 0.95 
.80 0.60 0.80 0.94 0.50 0.80 0.90 0.94 
.90 0.44 0.72 0.87 0.30 0.72 0.90 0.93 
95 0.31 0.66 0.78 0.16 0.66 0.88 0.93 
1.00 0.00 0.50 0.50 0.00 0.50 0.00 1.00 
aA = 2/3 
0 1.00 1.00 1.00 1.00 1.00 1.00 1.00 
25 0.97 0.99 1.00 0.97 0.99 0.99 0.99 
50 0.88 0.94 0.99 0.86 0.94 0.96 0.98 
.60 0.82 0.91 0.98 0.79 0.91 0.95 0.97 
.70 0.73 0.87 0.96 0.69 0.87 0.94 0.96 
80 0.62 0.81 0.92 0.54 0.81 0.93 0.96 
.90 0.46 0.73 0.84 0.33 0.73 0.93 0.95 
95 0.33 0.66 0.76 0.19 0.66 0.90 0.95 
1.00 0.00 0.50 0.50 0.00 0.50 0.00 1.00 





These efficiencies are important in any situation in which it is 
necessary to take into account the standard error of the estimate of 
the mean; the efficiencies are, of course, inversely proportional to the 
square of the standard error, the variance. Thus to say that one 
estimate has a higher efficiency than another is to say its standard error 
is lower. The size of the standard error is important in comparing 
different samples of children one with another, in fitting curves to the 
yearly mean values, in testing hypothesis A, and in estimating the 
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velocity of growth from one year to another,—indeed in practically 
every situation likely to arise in work on growth. The variances of 
these various estimates are given by Patterson as 


V(mn) = vn/Np, G 
and for C,~0, 

V (Mi) = vndr/Cr. H 
When C, —0 we have 


V(Ma) = 





H’ 


1—r?  r?V(Mi-1) 
Ni + Va-1 , 


which can be prolonged backward until some time ¢, is found when 
C,;~0 and V(M,) can be estimated by equation H. Note that 
equation H’ holds only when C, —0. Next 


V (nMn) — 1— a ' I 


which is the most convenient form for computation but holds only 
when C,=40. The general form, which holds for C, = 0, and reduces 
to equation I when C, +0, is 





V (sla) = VMs) 1 — PCa — dry Sb 


When C, = 0, equation I’ must be used, and V(M,) computed from 
equation H’, or repeated applications of it if necessary. Lastly, 


, s*h 

J (2M) — N(2u— ¢) > : J 
expression J applying only to circumstances of equal replacement of 
units on each occasion, that is, equal » for each pair of years. 

In Table 1 are given in the right-hand half the relative efficiencies 
of four estimates of increment, also from Patterson. In presenting 
this now we are anticipating somewhat, and we will return to its 
consideration again later, but it may be as well to point out directly 
that population estimates of rate of growth can best be obtained from 
two adjacent estimates of mean yearly values as given above. Which 
estimate of mean yearly values is taken naturally affects the efficiency 
with which the increment is assessed, and since it is really increments 
we are most interested in, these figures are of prime importance and 
are perhaps our best practical guide as to whether it is really necessary 
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to compute .,M@, instead of M,, ete. The four estimates of mean 
increment are m,— m,_, (difference of overall means, data treated com- 
pletely cross-sectionally ), My, — Mrs, Mincri,2 which is the best estimate * 
available using only the present and previous occasion figures, and 
M,—1M;-:. The efficiency of the first of these estimates is hopelessly 
low, of M,— M)_, none too good, but of Mx—»Mp-, very excellent. 
This is in terms as before of the efficiency of an estimate using all 
possible information, i.e. .M,— My... The estimate M,—rM;_, is 
all we need ever compute in any foreseeable circumstances, so that in 
computing increments we may need to invoke equations B and F, B for 
the second of the two years and F for the first of the two. (Note that 
nM, is the same form as y,,\/, but for the previous year.) 


Testing hypothesis A 

In testing hypothesis A we need estimates of the population mean 
at a given year ¢, and of the population standard deviation at the same 
year. The way to estimate the mean if the series is mixed longitudinal 
is as outlined above, by means of ,,M) if t,,, data are available, by M, 
if they are not. The standard deviation is obtained quite straight- 
forwardly from the values at ¢, alone, estimated as sp. 


Hypothesis B 


Hypothesis B demands longitudinal data, if the test of it is not to 
be hopelessly insensitive. Either pure or mixed longitudinal data will 
serve, but the mixed only if used as a longitudinal series, and not as a 
cross-sectional. The simplest way to do this is to calculate each 
individual increment from ?¢, to ¢, and then average them for the 
mean, and subsequently estimate the population standard deviation 
from them directly, as has been done in Dr. Thompson’s data. The 
increment from f, to ¢, of the child P is then compared to the estimated 
mean increment of the population from which P is presumed drawn, 
and the difference between the two figures compared to the estimated 
standard deviation of the increments.‘ 


* Discussed below; see equation N. 

‘This presumes a normal or near-normal distribution of increments. Though 
this point is one of very considerable importance in several respects, the only 
data on it I can find are Boas’ (28) on height at adolescence, which seem reason- 
ably normal, and Palmer’s (4) monthly increments of weight, which are somewhat 
skewed. It is most desirable that frequency distributions of annual increments 
should be published, 
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This way of dealing with a mixed longitudinal series is equivalent 
to picking out from it the pure longitudinal data from children present 
at both ¢, and ¢, and dropping the added cross-sectional data, that is, 
those children who appeared at ¢, but not at ¢t, and those who appeared 
at ¢, and not ¢,. Only A per cent of the data are used. This is not in 
fact the most efficient way to estimate the mean increment, since the 
dropped cross-sectional data can contribute a little, though not usually 
much, more information. It is however a very convenient way of 
estimating mean increment in many circumstances, since the individual 
increments have to be tabulated to get the standard deviation of incre- 
ment for use in testing hypothesis B, whatever way of estimating the 
mean increment is used. How the best estimate of mean increment, 
using all the information, can be obtained is discussed below; but the 
added information over that provided by the longitudinal elements 
alone is very often not worth the extra labour of computing and may 
provide scarcely any greater accuracy unless the percentage longi- 
tudinality of the data is improbably low—perhaps about 40 per cent. 


The uselessness of reporting overall means and standard deviations only 

First, however, a different point. The vast majority of growth study 
data up to the present have been published only as overall yearly means 
and standard deviations. Each overall mean is a mixture of its A 
longitudinal elements and its » cross-sectional ones. But nobody is to 
know how much of the data is longitudinal and how much cross- 
sectional, since only the total numbers at each year are given. The 
invitation is plain to treat the data as if simply cross-sectional, and 
indeed many extensive studies of so-called rate of growth do precisely 
this (e.g. 33). Now what will happen to the test of our hypothesis B 
if the mixed longitudinal data are treated as though they were cross- 
sectional ? 

This is the point already examined by Shuttleworth (1) and Wilson 
(2). We want to know what will happen to our estimates both of popu- 
lation mean increment, Mincr, and of population standard deviation of 
increment Sincr. Let us deal with sincr first. Suppose we have estimated 
poulation standard deviations s, and s, at time ¢, and ¢,, and consider 
initially a pure longitudinal series. There are two ways of working out 
Siner, both algebraically equivalent and providing exactly the same answer. 
The increments can be individually tabulated and their standard devia- 
tion worked out; this is the straightforward and obvious way, but requires 
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access to the original raw data. Alternatively, we can calculate Sincr 
if we know s, and s, and r, the coefficient of correlation between the 
values at ¢, and their corresponding values at ¢, by the expression 





Siner = V 81? + 82” — 278,82. K 


This second method of calculating Since would have to be used if the 
investigator knew only the means, sigmas and correlation coefficient, 
and had no access to the raw data. 

Now the crux of the matter is this: in a pure longitudinal series r 
will be of the order of .95, while in a pure cross-sectional it will be 0. 
This is stretching the terminology of course, and it is perhaps better 
to put it as follows: a mixed series consists of a longitudinal element 
and a cross-sectional element. Let us call r for the mixed series the 
correlation coefficient for the A per cent of children who do have two 
values, in other words for the longitudinal element only; it will usually 
be, as we have already said, about .95. There remain the p» per cent 
of children, who can also be paired one at ¢, against one at ¢, if there 
are equal total numbers on the two occasions. But in these pairings, 
made at random, one value of the pair will have no relation to the 
other value, and thus the correlation for these » children will be 0, on 
the average. The whole set of increments will thus be made up of a 
proportion for which the standard deviation will be given by equation K 
with r = .95, and a proportion » for which the correlation is 0. In the 
combined set, with equal numbers on the two occasions, we shall have 





Sincere = V 8,7 a $2? — 2Ars,S>. L 


As the percentage longitudinality rises, so since drops. Also, for a 
given percentage longitudinality, as r increases, Since drops. Thus if we 
ignore the last term in equation L and treat our mixed series as though 
it were cross-sectional with A = 0, we grossly over-estimate Sincr. Just 
how grossly can easily be calculated approximately. It can be seen from 
equation L that when s, = s, this approximate mixed longitudinal esti- 
mate of sincr is 100/Y 1 — Ar% of the estimate of sincr got by treating the 
data as though they were purely cross-sectional, that is, by putting A = 0. 
If we take r = .95, we have Sincr from L as 38%, 49%, 58%, 72% and 
84% of cross-sectional Sincr for A = .9, .8, .7, .5 and .3 respectively. Even 
if r reaches its lowest values of about .65, seen only in the first year of life, 
Sincer from equation L would still be 64%, 69%, 74%, 82% and 90% 
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for the same values of A. So much for the estimation of Sincr by 
ignoring longitudinality. Is the estimation of Mincr equally poor? 
A glance back to Table 1 suffices to show that it is. In the right- 
hand block are the efficiencies of various estimates of mean incre- 
ment relative to the best possible estimate, and the estimate ignoring 
longitudinality is the left-hand column of the block, m,— mp... Its 
efficiency is pathetically bad; for A = 2/3, it is only 54% efficient at 
r= .8, 33% efficient at r—.9, and 19% efficient at r—.95. The 
point seems worth driving home that a little biometrical study is worth 
a lot of collector’s instinct; at this level of longitudinality a growth 
study which pauses to extract all its information from the data will 
give as accurate an estimate of mean rate of growth as another study 
having five times as many subjects on its books, but reporting its data 
in the time-honoured way. 

Table 1 gives efficiencies relative to the best possible estimate; we 
have said above that the estimate from the contained longitudinal 
element only of a mixed series is often nearly as good as the best 
estimate and more convenient. How, then, does the overall means 
estimate m, — mj_, compare with the estimate /, —1’,_, obtained solely 
from the longitudinal element? Consider first the isolated longitudinal 
element with means J, —I’,_,, and then add cross-sectional cases at ty 
and ¢,.,. We are adding in subjects that were otherwise unavailable 
and in general increasing the number in the sample leads to a better 
estimate of the population figure. Against this in the present case, 
however, we are losing accuracy by introducing a cross-sectional incre- 
ment with a far larger standard deviation than the longitudinal incre- 
ment, and yet treating the two elements as equally important in relation 
to their numbers by using the inefficient overall means estimate. This 
second effect overrides the effect of the total numbers being greater 
until such time as the added cross-sectional numbers are increased very 
considerably over the initial longitudinal ones. A time does come how- 
ever when our overall means estimate becomes as good as the original 
estimate founded on fewer cases, but purely longitudinal ones. 

Wilson (2) worked out the mathematics of this and found that the 
time came when the ratio 


Children present at ¢, but absent at ta _ 2r—1 
Children present at ¢, and at ¢, = 1—r 





given the simplifying assumptions of equal total numbers and equal 
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standard deviations at ¢, and ¢.. In the terminology used above this 
condition is 


#., 9*r—! 
A™ ja -’ 
ia. aes. M 


I have put down in Table 2 the values for A corresponding to various 
values of r which give equality in expression M. These values for A 
are those below which the percentage longitudinality of a mixed series 
has to fall before treating the values cross-sectionally gives as good an 


TABLE 2 


Values of minimal percentage longitudinality \ in a mixed series for which 
estimation of mean increment is better from longitudinal values only 
than by treating overall values cross-sectionally 


A given by \ = (1 —r)/r where r is correlation coefficient of longitudinal element. 








r r 
.98 .02 
.96 .04 
94 .06 
92 .09 
.90 ll 
85 18 
80 25 
75 33 
.70 43 
65 54 
.60 67 





estimate of mean increment as does dropping the cross-sectional element 
completely. Suppose a given series has a longitudinal element whose 
correlation is .90; by reference to the table we find that dropping the 
cross-sectional element is more efficient unless the series is under 11 per 
cent longitudinal, a most unusual occurrence. Only when the correla- 
tion gets very low does the longitudinality have to be relatively high 
before cross-sectional reporting becomes the more efficient of the two 
methods. 

An example will show just how badly the overall cross-sectional 
estimation of mean increments and standard deviations serves us. We 
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will test our hypothesis B first in the incorrect cross-sectional manner 
and then taking our estimations from the pure longitudinal element 
only. The Brush Foundation data are a convenient source. In the 
monograph devoted to reporting the data (5) only overall means and 
standard deviations for each year are given and we cannot disentangle 
cross-sectional and longitudinal elements. But in a previous publication 
dealing with heights and weights only (6) this usual method is supple- 
mented by tables of actual yearly increments for each t,_, to t, longi- 
tudinal element, correlation coefficients for the longitudinal elements 
and tables of percentage longitudinality. (This is nearly, but still not 
quite, all that is needed to make the best possible test of hypothesis B; 
the authors omit to give the means for each year for the longitudinal 
and cross-sectional elements separately, i.e. 1, and ¢, and these also are 
required.) We will take the male stature increment from 6 to 7 years. 
The overall mean tables show there were 113 boys at age 6 with mean 
117.24 cms. and 104 boys at age 7 with mean 123.89cms. ‘The mean 
increment by subtracting one of these figures from the other would 
thus be 6.65ems. The 84 boys who constitute the longitudinal element 
however had a mean increment of 6.41 cms. and since r—0.97 and 
\ = 0.74, this is a much better estimate of the mean rate of growth 
from 6 to 7 of the population of boys sampled in this study. The best 
estimate, from equation F, cannot be made. 

Fig. 2 illustrates this difference between the overall m, — mp_, esti- 
mates and the longitudinal /, —I’,_, estimates, over the whole growth 
span, for stature and weight in boys and girls. The solid line represents 
the longitudinal element estimate 1, — 1’,_, and the dotted line the overall 
cross-sectional estimate. I have labelled the curves “ true ” and “ false ” ; 
both are unbiased, but the “true ” curve is much more precise, that is, 
its points are less subject to sampling error. The greater smoothness 
of the “true” increment curves is apparent in each graph, and in the 
top two figures I have indicated the + 2 standard errors of the actual 
mean increments at age 11-12. The “false” increment figure falls 
outside even these limits. One can without difficulty imagine a pre- 
adolescent slowing of weight velocity being postulated from the dips in 
the “ false ” increment weight curves at 9-12 and 9-10, and a 7-9 spurt 
in female stature appearing from the lower left-hand graph. The 
“false ” curves obscure the true course of events by their high standard 
errors, and may mislead the unwary into unwarranted conclusions. 

Now as to the estimation of Sincr. Presented only with overall yearly 
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means, we would be in something of a quandary what to use as Siner, but I 
suppose we carried on the same way as for means and calculated sinc } 
as for cross-sectional data by the usual method as Vs,?+ 8.2. This ] 
figure for the Brush male stature data is 6.56cms. Actually, however, ‘ 
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t 
the Sincr is far less than this, r being .97. From the longitudinal element t 
of 84 boys the estimate of Sincr is 1.29 cms., about a fifth of the previous, foi 
wrongly-made estimate. Suppose a physician was presented with a boy 
believed drawn from the same population as the Brush Foundation 1 
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sample, who had grown just 2.0cms. from his sixth to his seventh 
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birthday. If he knew the mean and standard deviation of the 6 to 7 
year increment as estimated from the longitudinal element of the study, 
he would calculate the boy was (2.0 — 6.41) /1.29 = —1.93 standard 
deviation units down from the mean of the population growth. If he 
took the hypotheses of the child being abnormal, either small or large, 
he would be very suspicious of abnormality, since the conventional test 
would demand a figure of + 1.96; if he took the hypothesis of the child 
being too short, he would be practically certain he was confronted with 
an abnormality since the test demands a value of only — 1.65, and he 
would proceed accordingly. But if he did not have these estimates to 
go by, but only those from the overall cross-sectional figures (which is all 
in practice he has got for almost any standards, such is the situation 
at present) he would calculate the boy was (2.0 — 6.65) /6.56 = — 0.71 
standard deviation units below the mean and (unless gifted with common 
sense) would pay no further attention to the matter. Had equation K 
been used with r= .97 as though the whole series was longitudinal, 
Siner WOuld have been 1.27 cm., or using equation L with 1—A, Sincr 
would have been 3.51 cm. 

In short, the inefficiency of our method of reporting and using the 
standards has completely vitiated the purpose of their collection. It is 
clear that data reported in the overall mean and standard deviation 
manner cannot be used as they stand either by the pediatrician for 
comparing a given child’s increment with that of a group, or by the 
biologist interested in rate of growth. 


The best estimate of mean increment 


We have already pointed out in passing that the estimate of mean 
increment from the pure longitudinal element alone, though better than 
that from overall means, is not the best possible estimate; it wastes the 
admittedly small extra amount of information provided by the thrown- 
away cross-sectional cases. Just as both longitudinal and cross-sec- 
tional elements can independently furnish estimates of the ¢t, mean 
(see above) so they can furnish independent estimates of the ty, 
to t, increment. Each estimate is given its correct weighting and the 
two combined to give the single best estimate. The formula for the 
increment from ¢,_, to ¢t, taking into account only t,_, and ¢, values 
(and not those prior to ¢,_, or after ¢,) was given by Wilson; it is the 
increment equivalent of expression A for the estimate of M, above. We 
will call this estimate of increment Mincr:,2; it is calculated as 
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The terms A and B are weights attached respectively to the longitudinal 
element increment and the cross-sectional increment. Under the 
simplifying conditions of equal numbers and equal variances on both 
oceasions, expression N reduces to 


A 7 a 1— - 4 rv 
ae ee. Sols ee ae N 


M incri1.2 = ——— 
a 1—pr 1—~wpr 


where the relative weightings are perhaps more clearly seen. | 
Equation N is the appropriate expression for computing mean incre- } 
ment when there are only two years in the series; there are other circum- 
stances, to be discussed in a minute, when it may also be very appropriate, 
But, what we need now, at least theoretically, is a formula for the ty, 
to t, increment which takes into account all the increments before fy, 
as did the formula B for M,, and one which takes into account all 
increments after ¢, also, comparable to ~M,. Such expressions are best 
obtained simply by estimating the means at ¢,_, and ¢, as detailed in 
the previous section, and subtracting one mean from another to get the 
estimate of mean increment. Thus the best possible estimate of incre- 
ment from ¢,_, to ¢t,, taking into account all data before and after t, 
would be ~M, —.My-,. Calculation of this is excessively laborious, and 
unnecessary because of the very high efficiency of the estimate 
M;, —»M,_.,. This is shown in the right-hand column of the right-hand 
block of Table 1, from Patterson. These figures hold strictly only for 





the conditions of equal replacement fraction (equal A throughout) and 
equal standard deviation at each year, but they will not be greatly altered | 
by removal of these restrictions. It will be seen (perhaps with relief) 
that even for high values of r and low values of A this M,—,M,., 
estimate is over 90 per cent efficient, and there seems no reason to use 
any more complicated estimate in any foreseeable circumstances in 
human growth data. Though this estimate can be expressed in terms 
of the M, series only, it is computationally simpler to work out the,M,_, 
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series also by equation F, and obtain M, — ,M,-_, by simple subtraction ; 
and in any case it is desirable to have the extra 10 per cent or so efficiency 
provided by »My-, in the estimate of mean yearly values. 

The other three columns in the right-hand block of Table 1 give 
the efficiencies of other estimates of mean increment. The overall 
means increment m,— my ,., we have already discussed. The estimate 
M, — M,-, is rather surprisingly a good deal less efficient for commonly 
met values of A and r than M; — »M)_, and so the latter is greatly to be 
preferred. On the other hand the estimate from the present and previous 
occasions only, Mincri,2, 18 surprisingly good, and though not quite as 
efficient as M,—,M)_, does approach it fairly closely. Most growth 
studies have most of their A fractions above 14 and their correlation 
coefficients above .95, and in such circumstances Mincri,2, is not greatly 
under 90 per cent efficient. 

The variances of Mincr1,2 and of M, —,M)_, (which would be needed 
in a comparison, for example, of boys’ rate of growth with girls’) have 
been given by Wilson and by Patterson respectively. 


AB 
A+ 8’ 
where A and B have been computed in evaluating Miner1,2 ; 


V(My—2Ma1) — V(Ma) + V(aMa1) —2cov(Ma,aMa.) P 


V (Miner 1,2) — O 


= V(Ma) + V(aMa-1) —tr>-(! —da)V(Ma.). P 
a | 


Expression P’ is the most convenient if the variances of the ,M)_, series 
have been calculated. It holds also for Cy_, 0, when V(,My_,) must 
be obtained from equation I and V(M,_,) from equation H’. When 
Cy. <0 and the variances of the M, series only have been calculated, 
the most convenient form is 


Tih — din nee + V(Mn1) 
x rok 7, on-1(1— bn) — 2r-* (1—dn)) » 


or directly from the basic statistics by 
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It would be very desirable to put in the mean increment curves 
estimated by My, —»Mp. and Mincri,2 in Fig. 2, but unfortunately the 
absence of figures for 7, from the paper cited makes this impossible, 
The considerations below, however, indicate that such curves would 
coincide very closely with the pure longitudinal ones. 


Estimation of increment from longitudinal element only 

This fact introduces the last important point in this section. 
The addition of the cross-sectional data estimate gives so little extra 
information under most circumstances as to be scarcely worth the 


TABLE 3 


Variance of estimate of mean t,., to t, increment from pure longitudinal element 
of miaved series only, in percentage of variance of best estimate, 
by equation N’, from expression 100(1— ur) /d 











r r= .95 r= .90 r= .80 r= .70 r = .60 
95 100.0 100.4 101.0 101.7 102.2 
9 100.4 101.3 102.2 103.5 104.4 
8 101.0 102.4 105.0 107.5 110.1 
7 102.0 104.2 108.6 112.6 117.0 
6 103.3 106.9 113.4 119.8 126.8 
5 105.0 110.3 119.8 130.0 139.9 
4 107.1 114.9 130.0 145.0 161.8 
3 111.8 123.2 146.9 170.1 193.5 
2 119.8 139.9 180.1 219.9 261.2 
1 145.0 210.6 279.9 370.2 460.1 





trouble of computing. This can be seen most easily in the somewhat 
restricted circumstances represented by equation N’. The relative 
weights of longitudinal and cross-sectional estimates in this ¢, to ¢, 
case are A/1—ypr and p(1—r)/l1—~pr. For r=0.95 these weights 
work out at longitudinal to cross-sectional 199 to 1 for A= .9, 82 to 1 
for A= .8, 30 to 1 when A=.6. A more compact way of recording the 
same thing is shown in Table 3. The variance of the estimate Mincri,2 
of equation N’, under the conditions for which the estimate is fully 
efficient, is 
2s2(1—r) 1 

N ; 1— pr 


Q 





V (M ner1,2) = 


(which can be obtained from equation P by imposing the restriction of 
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equal numbers and equal variances). The variance of the pure longi- 
tudinal series contained within this mixed one would be 2s?(1—r)/NA 
if we ignore the small correction in s* imposed by the fewer numbers, 
and is thus very nearly (1—ypr)/A times the variance of the best esti- 
mate given in expression Q. For any usual value of r, this fraction is 
very near unity; it is set down for varying values of r and A in Table 3. 
The variance of the estimate of mean increment obtained, for example, 
by dropping the cross-sectional element of a series with r—.95 and 
A} = .80 is just 101.0 per cent of the variance of the best estimate of 
mean increment. The gain in efficiency by using the best estimate is 
quite trivial for values of A down to 0.4 or so when r is .90 or above, 
and down to A= 0.7 for r—.70 even. 

Admittedly this test applies only to the equal replacement, equal 
standard deviation conditions and to the estimates of increment from 
the present (¢,) and the previous (t,_,) occasion only. We have already 
seen, however, that under these conditions the present-previous estimate 
of increment Mincri,2 is 90 per cent or more efficient, and the removal 
of the equal standard deviation equal replacement conditions will not 
greatly affect this. Thus while Table 3 may overestimate the relative 
efficiency of the dropping-the-cross-sectional data scheme for the un- 
restricted case, it does not do so to any great extent—probably not more 
than 10 per cent under most foreseeable circumstances. In addition to 
this, both the longitudinal-element-only estimate of increment J, —1’y-, 
and the present-previous estimate of increment Mineri,2 have one advan- 
tage (as Patterson points out for the latter) over the best estimate. They 
are not affected by the pattern of the year-to-year correlations specified 
above (and in Fig. 1) of r constant from one year to the next and of 
Thr-2 =T*nn1, ete. Departures from this condition do occur to some 
extent in human growth data at the early ages and during adolescence, 
and they will tend to make the variance given in expression P somewhat 
too small, and therefore possibly slightly misleading; the effect, though 
existent, is not a large one. In circumstances where this pattern of 
correlations does not quite hold, therefore, the longitudinal-element-only 
increment and the Mincri,2 estimate may actually be theoretically as well 
as computationally preferable to My, — Mj. 

The variance of the longitudinal-element only increment is given by 


V (1, —Vna) = (V(t) + V (Un) —2 cov (Ip, U'n-1)}/La. R 


At this point the reader may be referred to the two examples in the 
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appendix, where, in Tables 5 and 7, he will find tabulated the practical 
tests of these theoretical findings. In particular the estimates of change 
1, —l’,_. and M,—»My. should be compared (they will be found in 
the last and last-but-one blocks of these tables). The two estimates 
differ very slightly, as predicted ; this is seen better in Example 2, which 
is more extensive than Example 1, and does not have a large number of 
C,—=0 as Example 1 does. The average difference is about .02 to 
.03 ems. for half-yearly gains, representing about 3 per cent of the 
average half-yearly increment. The difference is on the average about 
one-third of the standard error of the M,—,M,_, estimate of mean 
increment; in only one instance out of the 32 calculated does it exceed 
one standard error, and then only just. The variances of these two 
estimates of increment are about the same, as predicted, neither being 
clearly systematically lower than the other. In Table 5, last block, the 
estimate of V(Mincri,2) is given for two increments; it differs imper- 
ceptibly from V (7, —/’,_.), as predicted. 


Testing hypothesis B 

In summary of this section, then, we may say there are two good 
ways of estimating the mean increment from one year to the next. The 
theoretically better way under most circumstances is by means of 
M, — Mp, and this has a variance given by expression P. A simpler 
way, in most cases nearly as accurate if the percentage longitudinality 
is fairly high, is by dropping the cross-sectional cases completely and 
taking the estimate as the mean of the longitudinal element increment, 
as has been done in Dr. Thompson’s paper. Under circumstances where 
the specified pattern of correlations from year to year does not hold, 
this estimate may indeed be theoretically preferable to the M, — ,My_, 
estimate. In circumstances where only an estimate of t,_, to ¢t, incre- 
ment is required, without an estimate of mean yearly value at ¢, and 
ty-1, the longitudinal-element-only estimate is the one of choice except 
in extraordinarily accurate work. since it is much faster to compute from 
the raw data than M,—,M,_,;. But if WM, and ,My_, are themselves 
required there is little computational advantage on one side or the other. 

The standard deviation of increment should be estimated from the 
actual ¢,_, to ¢, increments, i.e. from the longitudinal element alone. 
The deviation of a given child’s increment from the estimated popu- 
lation mean is then compared to this estimate of standard deviation, 
and an opinion given as to whether or not the child comes from the 














REPORTING OF GROWTH DATA 121 


population the standards are based upon; if not, the child may well 
be abnormal. 


Hypothesis C 


There is thirdly hypothesis C, dealing with the child P’s acceleration. 
Little need be said about this in principle, since the acceleration data 
bear the same relation to the velocity data as the velocity do to simple 
overall yearly figures. For any given acceleration from ¢, to ts one 
would have a longitudinal element of children present throughout the ¢, 
to t, period and throughout the ¢, to ¢t; period, and a cross-sectional 
element of those present at ¢, and ¢, and absent at ts, ete. From each 
of these elements one could get independent estimations. Note that 
this time there is yet a third element we might call distance-cross- 
sectional as opposed to velocity-cross-sectional. This third element con- 
sists of children present on only one of the three occasions ¢,, t, and fs. 
This element also can furnish an estimate of mean acceleration, though 
a poor one compared to the other two. 

The weighted combination of these three estimates which gives the 
single best estimate has not been published though it could be worked 
out by the mathematical method Patterson uses for velocity estimates ; 
no new principle is involved. In practice, however, considerable doubt 
remains as to how best to estimate the mean acceleration; at the moment 
acceleration is claiming little attention in the study of growth, but this 
will not always be the case. We could perhaps proceed as in velocity 
estimation and drop all the cross-sectional figures, both velocity-cross- 
sectional and distance-cross-sectional, and use the pure longitudinal 
series alone. However, it is probable that the estimate from velocity- 
cross-sectional data is not as greatly inferior to the longitudinal 
element estimate as it was in velocity data. It will be recalled that 
the relative importance of the estimate depends on the correlation 
coefficient for the longitudinal element, uniformly high in velocity data. 
The only incremental correlations (i.e. of increment ¢t,—#, with 
increment ¢; —?,) I can find are for the Harvard Growth Study data 
reported by Muhsam (7) and they are uniformly low; they range from 
zero to 0.6 for stature increments over the age span 7-15, and are 
practically uniformly zero over the same age span for weight. 

Thus it seems that dropping the children not present on all three 
occasions will not lead to a very good estimate of acceleration. 
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The relative value of the hypotheses 


The reporting of the data of a growth study relates to the hypothesis 
the study was designed to test. When an investigator embarks on a 
longitudinal study, he thereby declares that there is more value in 
hypotheses B and C than in A. For most purposes this seems a very 
reasonable belief. Growth, after all, is a form of motion, and motion 
is usually better characterised by velocity and acceleration than by end- 
results. The morphological events, too, only reflect biochemical pro- 
cesses, and if it is these that we wish to penetrate to, rate of growth 
seems a more appropriate characteristic to consider than size. Even 
from the pediatrician’s point of view, it seems that one is more likely 
to interrupt abnormality early and successfully by studying velocities 
whenever serial examination makes it possible, rather than by measuring 
their end-result. 

There is one further point that should be made explicit. We have 
been considering average yearly increments, and assuming these have 
some sort of validity as measures of velocity of growth. This they 
certainly have, but it must be remembered that when a measurement is 
rapidly decelerating during the year the average velocity over the whole 
year is not actually occurring at the mid-point of the year, but before it. 
The smaller the increment periods taken the better, as the nearer the 
calculated velocities then approach to the true curve, when the ripples 
due to seasonal differences have been ironed out. If the intervals are 
coarse, interpolations can be made by the methods described for example, 
in Yule and Kendall (8), or continuous curves can be fitted. There 
seems little point in fitting curves, however, especially to average incre- 
ment figures, unless one has a definite hypothesis to be tested by such 
a procedure, and unless the sampling errors of the curves’ constants can 
be calculated sufficiently well for the test of the hypothesis to be carried 
out. This point has been very clearly discussed by Sholl (9). Whether 
halfyearly periods should be used instead of yearly ones during the first 
few years depends partly on the data and partly perhaps, when the 
methods described above are used, on the pattern of the year-to-year and 
halfyear-to-halfyear correlations. If the latter fit the assumptions of 
the system better than yearly correlations, halfyears should perhaps be 


preferred. 
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Hypothesis D 
Pure longitudinal series covering birth to maturity 


So far we have considered only mean yearly values, mean year-to- 
year increments and mean accelerations. We have discussed how to 
make a test of whether a given child is of normal size at a particular 
year, or if he has grown normally from one particular year to the nect. 
In testing these hypotheses we invoked only the necessity of having data 
for a given child which extended over two, or for acceleration, three 
years. It is true that our estimates of mean velocity will be slightly 
improved by having larger stretches of data on single children, but the 
improvement is really a trivial one. The question arises as to what 
advantage, if any, is gained by collecting longitudinal data that cover 
a span of over two years. Admittedly there are obvious and manifold 
reasons in the physiological, psychological and sociological fields for 
prolonging the observation of the particular child, but we are not 
immediately concerned with these. We want to know if we can derive 
additional information as to morphological changes from prolonged 
individual data, and if so, how such data should be handled so that we 
actually obtain this information. 

Despite the number of longitudinal studies completed or in existence, 
relatively few serious attempts have been made to use their individual 
longitudinality. The difference of extended longitudinal studies over 
two-year ones is that a whole series of points are available for each child 
in the former, and these points can be characterised by some form of 
mathematical expression. A curve can be fitted to the points; and in the 
equation of this curve there will appear several parameters. The values 
of these parameters for each individual give the further information 
that we are seeking; the parameters give more than the simple velocity 
or acceleration at a particular age. Our previous hypothesis was that 
P’s increment over a single year-was normal; from 6 to 7 or from 7 to 8. 
We are substituting the more sensitive hypothesis D—that P’s growth 
over a number of years, preferably from birth to maturity, is normal, 
as judged by the standards of his group. To test this hypothesis we need 
to fit curves to the series of points generated by single individuals and 
to do this usefully we need a good hypothesis about the type of curve, 
or group of curves, to be fitted, and a considerable number of points. 
Equations can be fitted to an individual’s distance, velocity, or accelera- 
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tion curves, but the use of the distance curve seems preferable as its 
initial points are determined with only one measuring error, whereas 
velocity involves two. The fitted distance curves can be differentiated 
to give velocity and acceleration. 

Suppose that the curve used is of a form which has three constants 
a, b, and c. For a given population there will be many such curves 
and thus a series of a’s, b’s, and c’s, whose means and variabilities can 
be calculated. The individual P’s figures for a, b, and ¢ can be then 
compared with the figures for the group. The procedure is arith- 
metically laborious, of course, but likely to produce very interesting 
results. Though innumerable curves of growth have been proposed, and 
considerable numbers of individual growth curves amassed, there does 
not seem to have been any attempt to carry through this whole approach 
on a practically useful scale. Merrell (26) in 1931 fitted individual 
weight curves for 46 rabbits, but only as an example in her mathematical 
treatment of the problem, which we will return to shortly. Jenss and 
Bayley (10) in 1937 emphasised the method and introduced an equation 
now being studied practically by Dr. Jean Deming (11). And in the 
course of a discussion on the growth of skull dimensions in early child- 
hood, Count (13) has given a more complete example, in which a single 
individual’s parameters are compared to the mean values for the group. 

One of the things the method does is to eliminate time differences; 
that is, it allows for one child being ahead of another. Suppose, for 
example, we have two children who have adolescent spurts identical in 
form and intensity, but beginning at different times. If our curve is a 
well considered one, one or more of its parameters will characterise the 
slope of the spurt, another its peak intensity, and so on. These para- 
meters will be equal for our two children, for they are unrelated to time. 
Two other figures, referring to the time at which the change in velocity 
began, will be different for the two children, and will characterise their 
chronological advancement or retardation. Such parameters are the 
appropriate way of investigating the exact relationship between the time 
of the beginning of the spurt and the peak velocity achieved, and similar 
questions. In fact, the outstanding use of the method is Shuttleworth’s 
investigation on adolescence (1), dealing with this precise problem, 
amongst others. 

Fig. 3 may help to make this clear. The two graphs are diagrammatic 


and drawn after the manner of Shuttleworth. The five solid curves in 
Fig. 3A are individual velocity curves plotted against time. In Fig. 3B 
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the same curves have been arranged so that their points of maximum 
velocity coincide. In each graph the dotted line represents the average 
curve of the five. It is obvious that this dotted curve in Fig. 3A is a 
very poor characteriser of average velocity ; it smooths out the adolescent 
spurt, as pointed out some time ago by Boas (29) and Davenport (12). 























90 + 
INDIVIDUAL CURVES 
80 MEAN CURVE eecewccon 
70 70 
Fe 60, #0 
8 > 
rv) 
S sop oso 
>, my 
ee > 
a w 
2 40} £ 40) 
< - 
& < 
wo - 
FT.) & * 30} 
20} 20 
1OL 10 
i i i 1 1 L i | L i i i i Si i i i i. i i i i i i 
$6769 10 4 12 13 4 tS 67 “6 -S -4 -3 -2 -i MAX' 2 3 45 6 
VEL. 
AGE YEARS YEARS BEFORE ANDO AFTER TIME OF MAXIMUM VELOCITY 


Fic. 3. THe RELATION BETWEEN INDIVIDUAL AND MEAN VELOCITIES DURING 
THE ADOLESCENT SPURT, WHEN THE INDIVIDUAL CURVES ARE CLASSIFIED BY AGE 
(Fic. 3A) AND BY THEIR ACCELERATION (Fic. 3B). AFTER SHUTTLEWORTH, 1937. 
Fig. 3A. Curves plotted against age; Fig. 3B. Same curves plotted with time of 

maximum velocity coinciding for each. 


If individual P’s spurt were compared to such an average, he would be 
thought quite abnormal. But in Fig. 3B, the average curve characterises 
the group quite nicely. This is Shuttleworth’s graphical solution of 
the problem, a solution which proved exceedingly effective in his hands, 
but which is at best only semi-quantitative. The common practices of 
equating children for bone maturity age, or years before or after 
menarche, or growth of secondary sex characters are likewise attempts, 
more or less successful and only applicable over a short range of the 
growth period, to deal with this problem. 

This brings us to the last point of this section. Besides fitting curves 
to an individual’s points, one can fit a curve with the same type of 
equation, very often, to the mean values of a group, either of a cross- 
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sectional or of a longitudinal series. This mean curve, as we call it, 
will have certain values for a, b and c. But these mean-curve values 
will not, in general, be the same as the mean values of a, b and c obtained 
by fitting individual curves for persons X, Y and Z and averaging the 
ax, dy and az, the by, by and bz, and the cx, cy and cz. On the con- 
trary, we can plot another curve which does have for its constants these 
average-of-individual values, and this curve is called the mean-constant 
curve, as opposed to the mean curve. In Fig. 3A the average curve is 
the mean curve of the five individual values at each age; in Fig. 3B the 
average curve approaches a mean-constant curve. 

Merrell (26) has examined mathematically the relation between the 
mean curve and the mean-constant curve, to see how close a mean curve 
‘an get to the mean-constant curve, the one we really want. She found 
that it was possible to express the mean curve as the mean-constant 
curve plus a series of added terms. These added terms constituted an 
algebraic series, the first member of which involved the second order 
partial derivatives of the initial equation with regard to the constants 
a, b, c, ete. The second term of the series involved the third partial 
derivatives and soon. Thus the amount by which the mean curve differs 
from the mean-constant curve depends on the initial equation used, and 
when we have individual data available it can be evaluated. Merrell 
gives the examples of a parabola, where the mean curve and mean- 
constant curve are identical, and a logistic, where they differ considerably. 

This result is of very considerable importance, and we will pause 
for a moment to consider further examples of its application. In 
Count’s paper on skull growth already referred to (13), the average 
curve is given as a Mean curve, not a mean-constant one; Count did not 
calculate each individual’s a, b and c at all. He compares one particular 
individual’s a, b and c however with the a, 6 and ¢ of his mean curve. 
He implicitly assumes, then, that his mean curve is identical with 
his mean-constant curve. We can now judge whether he is justified 
in this assumption or not. The equation he fits to the points is 
y=a+bt+clogt. The first partial derivatives of this with regard 
to first a, then b, then c, are 1, ¢, and logt. The second are all zero. 
Consequently his mean and mean-constant curves do coincide. A little 


farther on in the same paper, he goes through exactly the same process 
using a dimension y plotted not against time ¢, but against another 
dimension z. This time his fitted equation is y= ae", and this time 
the second partial derivatives are not zero. The mean curve he gives 
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is not identical with the mean-constant curve and the comparison of an 
individual’s figures for a and b with those of the group curve involves 
a fallacy, the numerical importance of which is unknown. 

Count’s pioneering work in the mathematics of human growth curves 
provides us with another example, this time of a slightly different nature. 
Count has fitted a series of three curves to the cross-sectional birth-to- 
maturity data of stature (14). This mean curve has various charac- 
teristics, in particular three periods of acceleration,—prenatal, mid- 
growth and adolescent. Which characteristics of this curve characterise 
also the mean-constant curve, the one we are really interested in, and 
which came only from the sort of averaging process pointed out by 
Davenport? Merrell’s work provides at least a partial answer. The first 
two of Count’s curves are logarithmic, of the form y = a- bt + ¢ log ft, 
and so no distortion from the mean-constant curve appears. From the 
beginning of adolescence onwards, however, the fitted curve is a logistic, 
the form Merrell examined closely. Her studies indicate that this mean 
curve of Count’s will probably not produce any undulations that are not 
present in the curve of individuals, but it will misrepresent the degree 
of skewness of individual curves. Count’s logistic for females is asym- 
metrical ; Count’s male mean curve is symmetrical and in this case the 
mean-constant curve is likely to be asymmetrical. The magnitude of 
this distorting effect of mean curves depends chiefly on the standard 
deviations of a, 6 and c, which remain unknown for this type of data. 


PART TWO: TWO OR MORE MEASUREMENTS; PROPORTIONAL GROWTH 


The second part of this paper is concerned with proportional or 
relative growth, and the relating of two or more measurements 
throughout the growing period. As a rule this has been done by plotting 
the ratio of two measurements as a curve in time; if the line rises, it is 
said that the upper member of the pair is growing faster than the lower 
member. And if the line obviously curls upwards, that is. has an 
increasing positive slope, it may be assumed that not only is the upper 
member growing absolutely faster than the lower but that it is growing 
faster and faster still, that it is accelerating away from the lower member. 
The assumption sounds reasonable, but it is an incorrect one. Ratios 
can rather easily involve elementary statistical fallacies where their 
user is unwary; some of these I have examined elsewhere (15). We 
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must consider now what is the appropriate way of reporting data for 
the various hypotheses of relative growth which one might wish to test. 

Let us call the two dimensions / and m. As in the first part of 
this paper, there are three simple hypotheses we might be concerned 
with. Hypothesis A is that 1 > m at a particular age; cross-sectional 
data for the age in question obviously answers this. The hypothesis 
might further state that 1] > m by an amount which increases as time 
passes. On this hypothesis, then, //m > 1 and increases as ¢ increases, 
or (1—m) > 0 and increases as ¢ increases. The relevant test is to 
plot 1/m or (l1—m) against ¢. We then want to know if, say 1,/mz is 
greater than /,/m,, the subscripts referring to consecutive years. The 
procedure is the usual one of relating the differences between the two 
ratios to its standard error. Longitudinal data will be much more 
efficient than cross-sectional for testing the hypothesis because the 
standard error of this difference will then be much smaller for the 
reasons we have already discussed. We are comparing the velocities of 
the two dimensions and testing (albeit with bias, as discussed below) 
whether, from the period ¢, to ¢., 1 has a higher velocity than m. 
However, we are still essentially concerned with changes in form, not 
with changes in relative rate of growth. 

Hypothesis B is that the rates of growth of the dimensions 1 and m 
are changing, one relative to the other. The hypothesis is that the 
velocity of / is greater than that of m, and is increasing relative to m’s 
velocity as time passes. If it is true that biochemical processes are best 
searched for by considering rates of growth rather than size attained, 
then it is this hypothesis which has the most importance for us. If the 
fundamental control is of velocity, we must look for changes of velocity 
of J in relation to m; so long as the ratio of their velocity stays constant, 
we need not expect new chemical processes to be entering the picture, 
even if the external form is changing. The relevant function to plot 
is the ratio of the velocities, dt/dt/dm/dt, or using a dot above the letter 
to signify differentiation with regard to time, i/m. But I cannot find 
a graph of this function anywhere in the literature of human growth. 
Longitudinal data, of course, are needed, and the methods used for 


obtaining estimates of J;, and m2, 123 and t.3 etc. are those described 
in Part 1 of this paper. 
This notation makes it easy to see why we cannot in general arrive 
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at relative changes in velocity from looking at the slope of the [/m 


_ afl 
curve. The slope of the //m curve is 77 \ 7 and 


m 


dfl i lm 

dt (<) mm" 
The slope of the //m curve depends on the ratio of the two dimensions 
as well as on their relative velocities. This is illustrated by two 
examples in Fig. 4, where the solid lines represent 1/7 and the dotted 
lines the slope of 1/m, the raw data being the same in each instance. 
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Fig. 4. COMPARISON OF CURVES OF I/m, THE RATIO OF TWO VELOCITIES, WITH 
THE SLope oF [/m. BrusH FouNDATION Data, Stmmons, 1944. 


Solid line I/r, dotted line slope of U/m. 


No vast discrepancy occurs in these two cases, but the slope of 1/m 
fluctuates considerably more than i/m, and in some places the two 
plainly do not tell the same story. For example, in the male curve of 
sitting height/stature (upper left) the slope of //m increases more or 
less uninterruptedly from 2 to 15 years. But the ratio of the velocities, 
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I/m, is constant, or even slightly decreasing, from age 2 till adolescence. 
On the other hand, “(l—m) does equal (i— m) ; if the absolute 


difference between the measurements is plotted, its slope does give the 
absolute difference of the velocities. 

Hypothesis C is that 1 is accelerating more than m, and that the 
degree to which the accelerations differ increases with time. In this 
case I/m or (1— im) must be plotted. 


The function used to represent the relation between two dimensions 

The function of J] and m which would best represent the relation 
between the two dimensions seems never to have been discussed in 
relation to growth data. For a long time it has been tacitly assumed 
that the ratio 1/m is a good one, but the assumption seems to me 
to be very insecurely grounded. Why not (/—m), or (l?’—m?), or 
(1—1)?— (m— m)?, (log 1— log m) or logl/log m? Can we adduce 
any principle to guide our choice? The question applies equally, of 
course, to the relations between the velocities / and mr. (Allometry, dis- 
cussed below, is an example of using a more complicated function of 
these, notably i/1/rin/m .) 

The situation has perhaps some slight analogy with two statistical 
procedures, the analysis of covariance and Studentization. The latter 
term is applied to the finding of a statistic relating, for example, two 
variables such that scale of measurement or absolute magnitude of the 
variables does not affect the statistic. The process of finding such 
statistics is of great importance in statistical estimation, and perhaps 
may be the criterion for our choice in the present situation. Presumably 
it was the desire to obtain a combined measurement independent of the 
scale which led to the adoption of the ratio 1/m instead of the difference 
(l1—m). If is a dimension with a mean of 50cm. and m a dimension 
with a mean of 10cm., it would appear at first glance that (1 —m) 
would be dominated by the value of / at the expense of m,—that (1 — m) 
would be more dependent on / than on m. But that this is not always 
so can easily be seen; suppose / were constant, then (1—m) would 
depend more, and indeed solely, on m. Evidently the relative vari- 
abilities of 7 and m enter into the matter. Indeed, in general the 
correlation of (1—m) and 1 is 


Ti- mJ = (s; —— SmT im) /Sit-m 
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where $1, Sm, aNd Sim are the standard deviations of J, m, and (l1—~m), 
and rim is the correlation coefficient between | and m. The correlation 
of ([—m) and m is similarly 


Tl-m,m = (SiTim aa Sm) /St-m . 


If (l1—m) is to be equally closely correlated with 1] and with m 
(naturally, with opposite sign) it is easy to see that we must have 
either 
Si = Sm 
or 
Tim = 1. 


A similar argument can be carried through for the ratio 1/m. The 
correlation of 1/m and I is approximately (see reference 15 for more 
detail ) 

Vi— VenFim 


V Vi? + Vi? —2rinViVn 





Ti/m,t = 





and of 1/m and m 
Vitim oe Faite! Vas 
7 >. 3 ; of 
VI + Vn? — 2rim} " 
where V;, V,, are the coefficients of variation of 1 and m. For equally 
close correlation of 1 and m with 1/m we must have 





Tl/mm = 





Vi — Va> 


that is, equal coefficients of variation. In actual anthropometric work 
we rarely have either equal standard deviations or equal coefficients of 
variation. It is true that in some groups of anthropometric measure- 
ments the dimensions with bigger means have bigger standard deviations 
(for example, stature, arm length, biacromial diameter) but increase in 
standard deviation does not anywhere near keep pace with increasing 
mean. Consequently the measurements with the largest means, such 
as stature, have the lowest coefficients of variation. Taking //m instead 
of (1—m) to represent the 1, m relationship thus grossly overcorrects 
for the variances of the two measurements not being equal. We are 
out of the frying-pan into the freezer; somewhere in between lies the 
temperate zone. For instance, since stature has the lowest coefficient 
of variation of all the usually taken adult measurements, none of the 
per-stature indices (such as biacromial/stature), so dear to the tradi- 
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tional anthropologist, is independent of its two components. If the 
coefficient of variation of the other measurement is not equal to the 
correlation coefficient between the two, the ratio will increase or decrease 
as stature increases and decreases. For most per-stature ratios in fact, 
the ratio of the coefficients of variation is less than the correlation 
coefficient, and the per-stature index decreases as stature increases. This 
effect, it seems to me, nullifies in many or most circumstances, any 
usefulness the index may be supposed to possess. 

One possible way out of this difficulty is to relate two dimensions 
always in terms of their standard scores, such scores being of the form 
(1 —1)/s:, the deviation from the mean divided by the standard devia- 
tion. If the distribution of the measurements concerned is normal, this 
Studentizes them, i. e. makes them independent of scale. The procedure 
has been much used by psychologists and, in a slightly modified form, 
by the distinguished Italian school of constitutional anthropology. Such 
scores are often referred to as z-scores, and z; and z», may be related to 
each other either by subtraction or division without the result being 
dependent on / or m, since the variances and the means of all z-scores 
are equal. 

Translated into velocity terms, this approach would mean reporting 
increments of two dimensions in standard scores, that is as deviations 
from the mean increment divided by the standard deviation of increment. 
The principle could be extended, presumably, beyond two dimensions 
and might indeed provide the best means for comparing and plotting 
the velocities of several dimensions at once. So far as I am aware, this 
has never been done for growth data. Unfortunately the lack of any 
published standard deviations of increment except for height and weight 
(whose distribution is too skewed to serve) prevents an example being 
presented here. 

A second statistical analogy is with the analysis of covariance, though 
the problem for which the analysis of covariance might be appropriate 
is different from the last. We might frame the question of 1, m velocity 
relationship as follows: did 1 increase from age 8 to 9 more than m 
increased in the same year, each relative to their increments during the 
preceding 7 to 8 year? Proceeding without biometrical guide, we might 
try to answer this by comparing either the difference of 1 increments 
(1s,9 —:,<) with the difference of m increments (ms,9— m;,,). Or more 
probably by comparing the ratio of 7 increments I,9/l;, to the ratio of 
increments ms/m;s. Neither procedure would give the proper answer; 
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we have no knowledge that either allows correctly for the 7 to 8 year 
increments. The analysis of covariance specifically deals with this prob- 
lem; it works out the regression of the Ix9 values on the 1;, ones and 
corrects the J., mean for this and likewise corrects the msg mean for 
the m7s, Msg regression. The two corrected means are then compared. 
The procedure can be generalised to cover more situations likely to arise 
in handling growth data. Corrections can be made for mere than one 
previous value. Corrections could be made for another dimension n, 
instead of, or in addition to, correcting for previous values. Comparisons 
could be made between more than two measurements at a time. Thus 
if one suspected that in a certain year a particular dimension was 
growing faster than a group of others (say leg length relative to body 
widths from 8 to 9) an analysis of covariance could be done, allowing for 
either previous increments, a third group of bodily dimensions or both. 


More than two dimensions 


Ratios such as 1/m compare only two dimensions at a time, and 
though it is true that a large number of such comparisons may be made 
to yield, subject to the bias mentioned above, a fairly general picture 
of the whole body, a technique for comparing simultaneously more than 
two dimensions would be exceedingly weleome. The standard score tech- 
nique should accomplish this, and perhaps the covariance technique to 
some extent in the questions it is designed to answer. Another, and very 
interesting, approach would seem to be through the intercorrelation of 
the increments. If the velocity curves of two dimensions were exactly 
parallel, their correlation would then be 1, and deviations from parallel- 
ism, at least under certain circumstances, will decrease the correlation. A 
matrix of such correlations for a large number of measurements can 
be constructed and factor-analysed. Studies still in progress using 
mean increments indicate that this is quite a fruitful method. There 
are various difficulties to be faced, chief amongst them the difficulty 
or impossibility of estimating the standard errors of the correlation 
coefficients, since the velocity curves are autoregressive. There is also 
the objection that a matrix of mean intercorrelations is not identical 
to a matrix of the mean values of individual correlation matrices. The 
former is relatively easy to compute, the latter almost prohibitive; the 
relations between the two remain to be worked out. 

Another approach is the method of co-ordinate transformations due 
to D’Arcy Thompson (25). There would have to be, I imagine, several 
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successive transformations to cover the growth of a human from birth 
to maturity. There might be a first general one stretching from birth 
onwards and altering the infants proportions to those of the adult. The 
transformations representing the shape factors would interfere with this 
general transformation, more at some periods such as the mid-growth 
spurt and adolescence, than at others. The final transforming co-ordi- 
nates would be rather wavy, suggesting the analogy of a compound energy 
wave before Fourier analysis. One can imagine a system of electronic 
lenses which would produce from a baby’s dimensions, or even, shall we 
say, from a 3-month-old foetus, pictures of adults of every conceivable 
build according to how the lenses were set. (One can imagine too the 
chief use to which such machines would undoubtedly be put.) 


Multiplicative velocity 

We have dealt so far with simple velocities J and m. It has often 
been proposed, however, that rate of growth may be more usefully 
described by using velocities proportional to the size of the dimension 
that is growing. Such a velocity is 1/1, and has usually been called the 
specific growth rate. This seems to me an unfortunate name since it 
becomes very confusing in circumstances where different zoological species 
are being considered ; the word “ specific ” then gets used in two quite 
different senses at the same time. I propose to call this formulation 
multiplicative velocity, since the idea of its use is rooted in the concep- 
tion that a cell divides into two parts, each of which can divide into 
two further parts, and so on. Thus growth might be expected to be 
geometrically progressive, with / proportional to 1. If so, it is argued, 
it would be better to divide / by 1 when comparing it with the growth 
of dimension m. Otherwise, if 1 is bigger than m, and its constituent 
cells are dividing at the same rate, / will appear to be growing much 
faster since it has progressively more and more cells to divide. But if 
l/l is compared with rn/m then the actual rates of division of the cells 
of each are compared, and this is what is held to be important. However 
only in the simplest biological situations does each of the products of 
all divisions actually go on forthwith to division itself; differentiation 
of cells occurs and mitosis and the differentiated functions are thought 
to be in general somewhat opposed one to another. In a bacterial culture 
derived from a single cell the daughter cells all divide again at approxi- 
mately the same rate, say once every 20 minutes. This system is a 
homogeneous one. But in a multicellular organism there are many 
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different sorts of cells, each dividing, very likely, at different rates and 
providing a thoroughly heterogeneous system. Even the heterogeneity 
is not constant, since different sorts of cells very probably differ in repro- 
duction rate from one period of growth to another (30). Only during 
foetal life and during the general growth spurt at adolescence does a 
master chemical reaction appear which at any rate to some extent causes 
a general cellular reproduction. For similar reasons, metazoan growth 
cannot be simply compared to the growth of a human population, which 
is again unimodal and moderately homogeneous in reproduction rate. 

It must be realised of course that there is nothing necessarily wrong 
or incorrect in using either simple velocity or multiplicative velocity 
curves; the choice depends on the relative usefulness in the case in 
hand, and it is this that is debatable. At least in the growth of external 
dimensions in the human, / is by no means directly proportional to 1. 
If it were, the correlation between the ¢, to t. increment and the ¢, size 
would be unity or close to it, so long as the ¢, to ¢, interval were not too 
large. Data on this point are available (6, 16,17, 27) for height, weight 
and hip width, and in the case of the first the correlation is even below 
the level for significance, from birth to the beginning of the adolescent 
spurt. During adolescence some correlation exists, first positive, then 
negative. Weight increments do correlate to some extent with weight 
achieved, but quite apart from the association not being tremendously 
high, weight is a thoroughly atypical measure of growth in external 
dimensions or form. This is scarcely surprising since weight is not an 
external bodily dimension ; its inclusion on the same footing as the linear 
measurements in correlation tables and so forth seems to me most 
questionable. 

If multiplicative velocity is adopted for plotting, then the logarithms 
of the raw figures should be tabulated, and the ¢, to ¢, increments in logs 
plotted, since 1/1 = d(log1)/dt. Note that this is not the same thing 
as plotting the log of the straight increment, or plotting increments on 
log paper. ‘Turning the original observations into logs is a more legiti- 
mate way of calculating the multiplicative velocity than taking the ¢, to 
t, increment and dividing by either the ¢, value, the ¢, value or the 
average of the two values, since the ¢, to t. velocity may not be a straight 
line (see 18,19). 

In Fig. 5 are plotted straight velocity and multiplicative velocity 
curves for some of the Brush Foundation measurements (5). These 
mean increments are subject to the errors of cross-sectionally reported 
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Fic. 5. COMPARISON OF VELOCITY AND 
BrusH FOUNDATION DaTA, SIMMONS, 1944. 
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longitudinal series discussed above, but this does not affect the present 
general argument. It is at once apparent that in the graphs of bia- 
cromial diameter, bitrochanteric diameter, and stature, the two lines are 
practically parallel. The effect of using multiplicative velocity instead 
of simple velocity is to decrease the magnitudes of the mid-growth and 
adolescent spurts slightly, a result that follows from these periods being 
the time of highest correlation of J and 7. In the first year or two the 
reverse happens, and the multiplicative curves emphasise the deceleration 
more than do the others. The differences between the two however are 
really quite trivial. In the case of body weight, the last of the four 
graphs, the difference is of an altogether different order. Here the 
multiplicative curves have the same effect as before, but on a much 
larger scale. The simple velocity curve of weight looks very different 
from the curves of the linear dimensions, but the multiplicative curve 
belongs very much more with the linear family. Perhaps there really 
is some advantage in using the more complicated form of velocity for 
weight; there seems none for the linear dimensions. The figure empha- 
sises how rash it is to generalise about growth, and more particularly 
about growth of form, from a study of the growth in weight of an 
organism or an organ. 

There seems therefore no very good reason for making the trans- 
formation, as it is called, from ordinary measurements to logarithms in 
work upon external human dimensions. It merely adds a certain amount 
of complication to the subject, and diminishes relatively just those phases 
of growth that claim the greatest attention. In statistical work there 
are several good reasons for transforming given data into logs, chief 
amongst them skewness with a long tail to the right which gets corrected 
towards normal by the transformation,—a situation arising when the 
magnitude of experimental errors increases as the size of the object 
measured increases, and when, in repeated samples, the standard devia- 
tions of the samples are correlated with their means. These things do 
not happen to any extent in human external dimension data, though they 
may with measurements of such things as subcutaneous tissue thickness. 


Allometry 

If multiplicative velocity is accepted, however, as preferable in a 
particular instance, then the relative rates of growth of dimensions 
1 and m are given by i/1/rn/m or 


d (log 1) /dt 
d(log m) /dt ° 
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This time it is this ratio that is plotted and examined as to whether 
it increases, decreases or remains constant. This ratio is equal to k 
in the allometry formula 1 = bm*, developed by Huxley (20) and others. 
In fact the allometric formulation of the relative growth of two parts 
of an organism was specifically designed in accordance with this postulate. 
If we take logs of the formula and then differentiate with respect to ¢, 
we have 


log 1 = log b + k log m 


d d 
7 log 1 =k Tt log m 
d(log 1) /dt 


d(log m) /dt - 


It was originally contended that k (also called by some authors a) 
remained constant in many or most circumstances, or at any rate changed 
only once or twice, and that abruptly, during the period of growth. It is 
now clear that this can no longer be maintained; in most mammalian 
situations k varies with the passage of time. A graph of & against 
time simply plots the ratio of two multiplicative velocities: such graphs 
are to be found, for example, in the exceedingly valuable article by 
Richards and Kavanagh, which contains the best discussion to date of 
the whole problem of allometry and relative growth (21). 

In Fig. 6 several graphs of this sort are presented for human 
dimensions, using the Brush Foundation data, subject to the inaccuracies 
already discussed. The solid lines plot simple velocity ratios 1/ra which 
I have called j, and the dotted lines multiplicative velocity ratios, or 
Huxley’s k. That the two formulations tell very much the same story 
is again obvious. Equally so is the fact that & varies with time and has 
its own particular growth curve according to the dimensions considered. 
The upper four graphs for instance, show that at adolescence the bitro- 
chanteric diameter grows more rapidly, both in the simple and in the 
multiplicative sense, than does stature or biacromial diameter. There 
is a hint also that bitrochanteric diameter may grow more rapidly than 
biacromial during the mid-growth spurt. The stature/weight lines (in 
the lower right hand quadrant) decrease fairly regularly. Only in the 
sitting height/stature, in these examples, does k seem to remain sensibly 


constant for a few years, at least from 5 to the onset of adolescence. 
The last example gives us some insight into the relations between 
allometry and the results of factor analysis of the adult dimensions. 
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Solid line velocity ratio, dotted line multiplicative velocity ratio. 
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The factor analysis leads to a general size factor and several shape 
factors. It has been shown that the shape factors reflect periods during 
growth when one set of bodily dimensions grows more rapidly than 
other sets (22). At adolescence, for example, there is a large spurt of 
sitting height and little of leg length: and at just this time a particular 
group factor for sitting height emerges in analyses done for each yearly 
age group (23). The graph of sitting height/stature shows the enormous 
increase in the value of k just when this factor appears. So long as no 
shape factors are operative, it seems that k stays constant, at least for 
certain pairs of more or less comparable dimensions. In fact, k was 
designed as a constant specifically on the assumption that changes in the 
rate of self-multiplication affect all parts of the body equally (24). 
It seems, then, that so far as k& is constant, it measures the general factor 
of size, in the multiplicative system; and its departures from constancy 
measure the amount of differential development of the various shape 
factors, from between the two dimensions considered. Thus & stays 
more nearly constant for arm length/leg length where the advent of the 
leptosomie factor affects both similarly, than for arm length/cuest 
breadth, where, though the general factor operates on both, the lepto- 


somic does not. 
SUMMARY 


1. This paper discusses the biometrical handling of growth data, 
particularly those from a mixed longitudinal series, i.e. a series of 
children some but not all of whom leave the study each year, and some 
of whom, of various ages, arrive newly each year. Part 1 discusses the 
case of the single dimension such as stature; Part 2, the relations 
between two and more dimensions. An appendix gives detailed numerical 
calculations of the statistics recommended in Part 1. 


2. Part 1. Data are collected to test hypotheses. Three possible 
hypotheses are: (A) child P is abnormal in stature at a given age; 
(B) child P grew abnormally fast from one given year to the next, i.e. 
had an abnormal velocity of stature; (C) child P accelerated abnormally 
in stature. The general method whereby the physician or biologist tests 
each of these hypotheses is outlined, P’s deviation from the mean of the 
population of which he is thought to be a member being compared with 
the standard deviation of the population. It is emphasised that the 
data from normal children on which our standards are based are essen- 
tially sample data from larger populations. 
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3. The way to estimate the population mean for testing hypothesis 
A is detailed. Mixed longitudinal data must not be treated the same as 
cross-sectional data, or a large decrease in efficiency of estimation results. 
The various formulae for estimating population mean due to Patterson 
are set forth and their use discussed. 


4. The estimation of mean yearly increment needed for testing 
hypothesis B is discussed. The usual practice of reporting growth 
study data only in terms of overall means and standard deviations at 
each year is strongly condemned, and it is shown how a sensitive test 
of hypothesis B cannot be made using standards reported in this way. 


5. The formula for the best estimate of population mean increment 
is discussed, and it is shown that the estimate made by taking only the 
twice-measured longitudinal element of a mixed series and dropping the 
added cross-sectional cases entirely is nearly as efficient as the best 
estimate under the usual conditions of human growth studies, and simpler 


to compute. 


6. The advantage gained by having a series of measurements on a 
single child extending over a considerable number of years is discussed, 
and the conclusion reached that it can only be realised by fitting a 
curve to the individual’s measurements. The parameters of this curve 
are then compared with similar parameters for other children, thus 
testing a fourth and more critical hypothesis—that the child P grew 
normally over a long span of years. The point is made that the curve 
fitted to the mean measurements of a group of children will not in 
general have for its parameters the mean values of the parameters of 
the individuals’ curves. 


?. If the biometrical procedures outlined are used in reporting 
growth data, considerably more information will become available from 
growth studies than has hitherto appeared. However, because no bio- 
metrical procedure is final and the accumulation of longitudinal data 
is exceedingly costly and time-consuming, every effort should be made 
to publish or put on easily available record the original raw data of 
such studies, however excellent the biometrical workup. 


8. Part 2. The reporting of the relation between two dimensions 
as the ratio of one to another is criticized. The analogy of Studentization 
is used in discussing what function of the two dimensions best charac- 
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terises their relation, and it is suggested that this function is one that 
is independent of the scales of measurement of either dimension. 
Standard or z-scores are such a function and their use is recommended, 
Analysis of covariance technique might also be used in certain situations. 


9. The relations between increments of more than two dimensions 
considered simultaneously is discussed; factor analysis of increment 
inter-correlations, and a successive series of co-ordinate transformations 
are mentioned as possible approaches. 


10. The reporting of increments as multiplicative velocity curves 
(1/1) (dl/dt) is criticised and graphs given from human data showing 
the parallelism of such curves to ordinary velocity curves. The point 
is made that in general correlations between yearly increment and size 
at the beginning of the year are low; and the biological basis of multi- 
plicative reporting in the human is called into question. 


11. The question of allometry is discussed, and graphs given 
showing that in the human fk, the so-called constant of the allometry 
equation, and the ratio of two multiplicative velocities, is far from 
being constant during growth and plots a regular curve which closely 
parallels the curve for the ratio of ordinary velocities of the two dimen- 
sions considered. The relation of allometry in growth and the factor 
analysis of adult physique is briefly discussed. 


12. In the appendix a sample lay-out for reporting mixed longi- 
tudinal data is given, with detailed numerical calculations of the various 
estimates mentioned in Part 1. 
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APPENDIX 


NUMERICAL EXAMPLES 


In this appendix two examples are given of the full work-up, as 
detailed in Part 1, of data on a single measurement. The figures are 
presented in sufficient detail for the computations to be followed by 
investigators with a minimum of biometrical knowledge, or by secretary 
computers. 

The first example is taken from Meredith’s data (35) on the growth 
of the upper limb (acromion process to tip of middle finger, arm extended 
down the side of the body) in girls from age 3 to 12. The subjects 
were healthy children attending the University of Iowa Preschool Labora- 
tories and Elementary School during the years 1937-1946; they were 
mostly from the professional and managerial socio-economic group. These 
data are rather highly longitudinal and provide a somewhat simplified 
application of the methods of Part 1 in that, as can be seen from Table 4, 
new children are only joining during the first three years. From 6 to 8 
the data are completely longitudinal, and at no year do new children join 
and old children leave. 


The second example is drawn from Maresh’s study of the growth in 
length of the humerus (36). The bone was measured by X-rays, the 
focus-film distance being 74 feet throughout, and the humerus measured 
from one epiphyseal line to the other along the long axis of the shaft. 
The children were healthy girls attending the Child Research Council’s 
study of growth at Denver, Colorado. The data are for the most part 
reported over 6-monthly intervals and I have followed this in the analyses, 
omitting in Table 6 a few values which related to intervening 3-monthly 
measurements. As can be seen from the table, this data is much more 
fragmentary in the longitudinal sense than that of Meredith, and pro- 
vides a more complete, and slightly more complicated application of 
the methods of Part 1. 


The publication of actual raw data in growth studies is still a rare 
event, and one must be most grateful to the authors for making these 
series generally available and even more so for the foresight of the 
editors who managed to get them published. 
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Notation 
We repeat here, with some additions and omissions, the notation 
given earlier. 


For all subjects present at t,: 
N, = number of subjects; 
=r,, Sz,” = sum and sum of squares of measurements ; 
m, = mean of all measurements ; 
Vp = (2, — m,)*/(N, — 1) = variance of all measurements; 
Sx = Vv, = standard deviation. 
For subjects present at ¢,; who were also present at fy_1: 
Ly, = number; 
Ih, Sl, = sum and sum of squares of measurements ; 
‘ a 
l, = mean; 
V (l,) = variance. 
For subjects present at ¢, but not present at t,_,: 
Ch, SCa; ae *, é,.* Ven). 


We shall also need to consider the measurements of subjects at ty, 
who were also present at ¢;. We denote these by primed symbols 


D’y-1, } | oe ete. 


Finally, we shall need the covariance of the measurements of subjects 
present at both ¢,_, and ¢,: 


cov (U'n-1, ln) = S (Un —Vna) (a — lh) / (Tan — 1). 
In actual computation, we work from the relation 
S(U na —V na) (a — hh) = Saal — Waa h/Tn . 


Example 1. The layout of data and basic calculations in Tables 4 
and 5 is the one we have found most convenient in computation. The 
arrangement of the raw data follows Meredith directly and requires no 
comment. The next two blocks, sums and sums of squares of values, 
are best taken together with a calculating machine. Such a figure as 
831.4 for 3I,, for instance, represents the sum of the age 5 values of 
those children who were present at both ages 4 and 5. Similarly, the 


5 In Tables 5 and 7, J, is used in place of In; cn in place of &. 
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Length of upper limb (cms.) age 3 to 12 years in healthy girls; data from Meredith 1947 
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figure 384.8 for Xc, is the sum of the age 6 values of children who were 
not present at age 5. The quantity Sz, refers to all children at time h, 
so that the relation 

=r, = Sl, -f- Cr 


provides a check on the arithmetic. The same remarks apply to 32,?, 
etc. (The Sc,* values are not needed subsequently for any of our main 
purposes and could be omitted from the table; we have retained them 
in the general layout, however, as they may sometimes be needed, and 
it is of some interest to compare the variances estimated from the cross- 
sectional and longitudinal elements.) 

The next block introduces the only possibly confusing element in 
the table. This refers to the children present at t,. who were also 
present at ¢,. That is, in terms of the age column fy, the J, (unprimed) 
values throw back, the /’, (primed) values throw forward. For com- 
putional purposes, it is easier, and more logical, to put 3I’,_,, ete., in 
the ¢, column than in the ¢,_, column. Thus the figure 772.0 is the 
sum of the values at age 4 of those children present at both ages 5 and 4. 
The sums of squares, 3/’,_,? are similarly treated. 

The last row is needed in getting the correlation of the values at 
t,., and ¢,. It is simply the sum of products of the /’,_, and J, values. 

In Table 5, the first block consists of the numbers of children, Ly 
being the number present at ¢, who were also present at t,,, Cy the 
number present at ¢, who were absent at ¢,_,, and N, the total at t). 
The longitudinality fraction, A, is calculated from L,/N,. The next 
block is of means, recorded to four decimals to prevent rounding errors 
when they are used in further calculations. Then comes a block of 
sums of squares of deviations from means and of cross-products of devia- 
tions from means. These are calculated from the sums and sums of 
squares of Table 4 by the usual methods. The first figure in the 
(1, —1,)* row, at age 4, is obtained as 


24081.53 — (559.3)?/13 — 18.7231. 


The calculation of S(c, — é,)? may be omitted unless the cross-sectional 
element variances are required for some reason such as the computation 
Of Miscei.2 - 

At the end of the rows for 3(l,—J,)?, ete., at the right-hand end 
of the block, are recorded the overall sums of these rows. These are 
used in computing the overall r; thus 











150 J. M. TANNER 


716.5161 


overall r = ~ 7e75 9441 y¢ 793.8091 9 '81- 


The next block records the variances and covariances, obtained from 
the figures in the block above by simple division (using L, — 1, etc.) ; 
thus V(l,), for example, is 18.7231/(13—1) —1.5603. The only 
statistic in this block actually needed in the most usual further compu- 
tations is V(1,), and the others may be omitted if desired. 

The next block consists of two rows only; the upper records the 
regression coefficient for each set of values at ¢, on the corresponding 
values at ¢,,, and the lower the corresponding correlation coefficients. 
Both should routinely be calculated since only by looking at these 
figures can one decide if the conditions of equal r for each tar+ 
pair is met in the data in hand. Note however that the figure for 
“overall r” is not obtained by averaging the r’s given here, but by the 
process described above. The formulae for the regression and correlation 
coefficients are the usual ones; the regression of age 4 on age 3 
values is 13.9646/12.8369 — 1.088, and the correlation between the two, 
13.9646/ V 12.8369 & 18.7231 — .961. It will be seen that in the present 
example the correlation between ages 3 and 4 is rather low, and that 
the correlation drops also as puberty is approached at the other end of 
the series ; but the effect is probably insufficient to invalidate the further 
computations, in all of which the overall r of .9781 is used as an 
estimate of the population figure. 

The penultimate block contains the statistics we chiefly require to 
know. The first to be computed are (1— gq), and ¢, taken together, 
the former being first computed from equation C, that is 























L 
1—¢ = Snivas » Cnr #0. 
N —— 20 1—-= = L 
h rCn( Cn. n) 
For age 3, Lx, = 0 and (1— ¢,) = 0, and ¢,=—1. For age 4 we have 
13 
(1— gr) = . Tc 13 me 1222 ; 
18 — .9781? & 5(1 ee ) 
gra = .2778 
For age 5 we have 
18 
(1—¢n) = — = .6429; 
28 — .9781? x 10(1 =X") 


gn = .3571. 
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For ages 7 to 12 we have C, = 0 and (1— ga) = 1; dx = 0. Note that 
when both Ca..—0 and C,;=—0 then (1—¢,) —1 and the compu- 
tational complexity of equations C’ etc. does not arise; it only occurs 
when Cy-1 = 0 and C,~0. A case of this occurs in Example 2 at age 6 
and is illustrated below. 

The successive values of M, are now found from equation B, that is 


TS), 





M, = (1— da) {ha + 2. —In+)} + dnén. 
1 
At age 5 for instance, we have 
M, — .6429(46.1889 + .9781 V2:2610 (49 8778 — 49.8889} 
V 2.7069 


+ .3571 & 45.9700, 
= 46.1043. 


Note that /’,_, is in the same column of Table 5 as ga, Gy, Ma, etc., not 
the preceding one. Very particular care must be taken in calculating 
and checking the values of (1—¢n) and of M,, as any error upsets 
all succeeding values; it is best to compute and check twice in sets of 
not more than five values at a time. The next row 4,;My, is now taken, 
using equation F, that is 
neal, — My — Pat ah, — tana), Cae 
Sain 


At age 5 this time we have 


V 2.2610 


x 
————— >& — & .3571{48.9417 — 48.1000 
V 2.8860 x 10 x { } 


nei, = 46.1043 — .9781 
= 45.8955. 


Note again that though at age 7 and over we have C, —0, we do not 
have to use equation F” to evaluate 4,:M,, as Cr. = 0 also and thus 
the second term disappears, leaving ».:M@, —M,. For an instance when 
Cy =0 and Cy,, #90, see Example 2. 

The best estimate of mean increment that can be obtained by these 
procedures, M), — My, is then written down; note that the 4 to 5 year 
increment in this terminology is recorded under age 5 (h = 5 years) 
and the subtractions run diagonally, i.e. 


Increment 4 to 5 years = 46.1043 — 42.7980 
= 3.3063. 








: 
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The last three rows of the block contain the variances of these 
estimates of mean and mean increment. The variance of M, is obtained 
from equation H, if C, 0, that is 


V(M,) == Vadn/Cr . 


V(M)) at age 4 is thus (2.7069 & .2778) /5 = .1504. 
At age 7 however C; = 0 and we have to have recourse to equation 
H’, that is 





r?V (My) 


Vha-1 


oS een 2 
V (Mh) =o} "+ 


so that at this age 








; nwag ( 1—.97812 . 97812 & .0796 
igen) > See 36 2.8660 

= 3.3768 & .027775 (this figure being V( Mn.) /vn-.) 
= .0938. 


The variance for ages 8 to 12 are calculated similarly for each V(M,), 
the V(My-,)/va-. having been recorded in the previous step. 

The V(auM,) can now be recorded, if it is required. Equation I 
is used, that is 





V (nerMa) aia UnPr — r?Cnsrba(l — dns) } 


Cx Ch 
At age 4, this is 
V (nerMn) = .1504{1 — 9781? XK 10/5 X .6429 X .2778} 
= .0990. 


At age 3, (1— ¢n) = 0 and thus V(n,,¥,) = V(Mp), and at age 6 
and beyond, Cy,, 0 and the same is true. It is clear, however, that 
the estimate ,,,M@), at ages 4 and 5 is better than the estimate M,. 

Lastly the V(M,—,M,-.,) is recorded, from equation P’, in the 
present instance, that is 


V (Mn — 2Man-) = V(Ma) + VOaMan) — or (1 — gn) V(Mi.,). 
PhA-1 
At h = 6, the figures for the variance of the 5 to 6 increment are 
V 2.8660 


V (My — aMp-1) = .0796 + .0635 — 2 & .9781 K — od 


2.2610 





xX .7778 & .0807 


= .0048. 








ese 
ed 


on 
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In the last block are recorded for comparative purposes the longi- 
tudinal element mean increment 1, —J,.,, and its variance from 
equation R, that is 


Vln — Vn) = (V(t) + V (Una) — 2 cov(In, Vn1)}/Ln, 


and the mean increment estimated from the present and previous 
occasion only, Mincri,2 from equation N that is 


A(l, —V'n-1) + B(&— @n+) 
M iner1,2 = A +B ’ 





with 
Vien) , Viena) 
_— Ch + i 
_ Vr) + V (Un) — 2 cov(In, Un) 
Tn ; 
and its variance, from equation O, that is 


AB 
A+B° 


The computations for the age 5 to 6 increments are as follows 


A 








B 





V (Miner 1,2) — 


1, —Vy_, = 49.1893 — 46.1107 
= 3.0786, 


2.4432 + 2.2610 — 2 & 2.2972 


V(h—Vna) = 3 





— .0039, 





§ 3.8514 1.1401} § 2.4432 + 2.2610 —2X2.2972 
} on 


“= + (49.1898 — 46.1107) + ; } (48.1000— 45.9700) 

















ls ae — 10 28 
— f 3.8614 , 1.1401} | | 2.44884 8.9010—3X2.2978 | 
13 10 St! 28 
= 3.0724. 


The content of the second curly bracket is V(l,—I’n.), calculated 
already, but this will not usually be directly available under the ordinary 
circumstances in which Miner1,2 would be used. Lastly 


(108.8514 + 8X1.4011)(2.4432 + 2.2610 — 2X 2.2972) 
28(10X3.8514 + 8X1.4011) + 10X8(2.4432 + 2.2610—2 2.2972) 





V(Miner 12) = 


= .0039. 


Example 2. The data from Maresh’s study are reproduced in Table 
6, and in Table 7 the various statistics are displayed. The layout 


6 





— i cantante ma A 











J. M. TANNER 


154 














86 FFT 09°9661 PLSZZI 61° L292 PL ELzZ% 9¢°S661 ZS S9ET 69° LILI 88 TS61 g0'609T 19929 00°0 yr" = 
Te F6el ZU 1I6r €8'6L11 LZ 1962 EVLLIZ EL 9Ss8T 8L OFZ OF 06ST OF 6LZ21 L9 OEF1 TL2l¢ 00°0 a <4 
GPL 8°90I ¢'s9 £°Srl L6€I 1'0eT 8 E91 0'9ZI OCST £°9e1 £'e9 00 rW)sz 
OS Z6FT LE LEvs 1Z°8092 LE°SL6Z Lz°9L9Z% 6£° 689% 86 0FLZ ZO'SLES L8°69%2 99 19ES RP 6FST 88682 2 
00°0 bP ese 69° FEET bP 792 92°08 £o'Sze 9E'EbS 69°6¢¢ 89° LZE co'1e¢ 96'6FL 88682 319 Z 
OS L6FI £6 E80 SVELZ1 £6'60L2 IG’ ELes 9S OLS 69° L642 teeest 6 SEIS 19 OISI 29°66. 00°0 37 = 
ZLL £081 TPP Tell £°eel TOLI o'Sst OLLI £261 2 00% a Let 888 Zz 
00 88ST 6ZL 291 PLI oz 91 6°0F 9'Sz 6°9F PSL 888 Wg 
LL It ZL 6'SeT 6 ctl 9 LET 6ZLI 191 L99I feet SPL 00 "1S 
SLI rE 
fst Lli OLI 2°91 €e 
8! &, &! v! ze 
OLI OLI POL I¢ 
ai os ¢6l oe 
T'0z 261 esi LI Lgt 6z 
L002 661 8st est PLI op Oe 8% 
9°02 661 261 ess Ss - os Lz 
ris 691 2 9r gel SFI im ai eer 6 ZI 92 
sl ae e°L1 POL Sel Zr Zel SZ 
es OLT gol O9L 2st Vt Zl bz 
ZS Li Lgl 9¢eI 8 bl a PCI £3 
OST OLI POI Lgl Ltt 9eI Sl rd 
gor Tot ol £1 vel 8 ZI aes ve 1Z 
O'9T Ze ial 81 oer Val Zt OO! 0% 
i ae g'cl ol 8'eI 6 ZI 4 tt POL 61 
O'sT £1 9EI Lat git 601 +O * SI 
mn 6FI Tel gel zal 9 IT Por LI 
~ £+1 ba oa | 9ZI 9 IT £01 9I 
OFT Ser 6 ZI SII gor cI 
on : Z et Sl a tl 
6 ET Tel vol Zt 90 06 eI 
: a Sie “be : 601 06 ZI 
6ZI 9 IT gol 88 It 
: ie OCI Ol 68 or 
ets Lal OL o'6 6 
: Tet £31 Lot 0'6 8 
gel ara | 901 16 Z 
: os Vil L'6 - 9 
3 gel fst Tit ¢ 
v! ¢| I ¥8 e 
4 : 0% g 
Ts I 
9 “ee ¢ “YP ¥ “KE £ Kz Zz “I I &% ugaWoN 
SUV@A NI GOV Loarans 





(98) yseuv py Worf DyDp /8)410 Aynvey ur suvah |] 07 SYyUoW g WoLs (AD4-x fig paansvew ‘sw2) sniauny fo yjOua'T 


% AIAVL 








a> 
Eton) 
re 


REPORTING OF GROWTH DATA 














cO'Is6s = EZOTI «= «TO'SOGT SC OTOSLE §=—s «SHES «=—s«sIG'ZIIO «=e eu'gRog «= s«ON'ZTEb «= SO'OFZE = «OOW'AGTIT «COM TStSisCE'VGIT IIE 
OZIS8S COGIT  OG'SRPS  ZLIPOE  BELETS SORES  SHIICG ZHOSIb LZ9I9E  BZSIIZ  2O'66HS I9Z9IT es 
0° 201 8h 9°66 O'LFI O'L1Z $'992 19% 6'E61 6'691 9°Z01 Zzel 6'89 m4 Zz 
BS'ZZ0E = TISST «=: OO'EHIT «= GHTIOGH «= GTIZSL «= «OT'SOGL «=sCOSEFEL §«=s«OL'ELES «=: GH'LIED §«=—- HOLE «= OHOIFE =: OF'EETE wz Z 
00°0 60'0I9T + —-00°0 60°0¥9 PSP9ES ELSLZI ISLET OFSZST Szgers  ZO'ZEZS  E9'FOL 09°016Z 9 Z 
SS'ZZ0E «= LTILZI = OO'EROT )«=sCOSIGSE }=—s«dTH'OSHS «= «ES'GSTD «= « GE'GOSS «= «HA'NPFH | CHZ'ZSBE |= «HG'SGIZT «= LG'SHOT «= «G'ZZZT 1 Z 
9'60I 0°01 9201 €°LL1 g'sIe PElE e'2ze 9°29 £°98Z Z'01Z 8'F9I 0'£0z WZ 
o'0 L'99 0'0 £°9% 0°46 ¢'08 189 9°29 £°O1l 9°S0I 1'6e 9'Z41 WZ 
9°60I £'09 9°Z01 O'ZST g'12z 6292 6 EST 0°00z O'OLT 9°40 L'gzt ¥'09 "Zz 
O22 ven — pore ; a - + ¢ . = = 
Ts eae ‘ = mn +k zc i me vee 7 
. ‘ pe a3 bs te e § : a 
; : 9% es oa ow e fe - 
ots ves our ves re fe ce i a ee ‘i }4 
0'0e 3°63 “* ee 8°SZ ee ee. oe e* * Ly 
. : a 9 ves ‘3 ow neh xe : 4 
: Lez ko 9% S'1Z £°1Z “i 8'6I dp oF 
9°92 9°92 0°SZ Lvz = nt 9°22 SIZ te pi th 
ste ¥'SZ L¥Z re id g'£% 62 612 9'1Z he o ey 
: = 6£Z 28% £22 S'1Z e'1Z 10 55 ze 
WLZ £°9z 8°SZ 2°SZ o'FZ OE oh ae 7 £02 ha 
‘ $°9Z £°9Z 8°FZ ard sb l'ez = as rs OF 
4 si Lz Z'tZ £'1z 6'0Z 2°02 ts Z'6I 6 
. £82 igs ¥2z 9'1Z 9'0z L6l £'61 Se 
UFZ rez 6°22 V2 Gz wie " 8°61 le 
Fas Tc 9'£Z £°8Z ead Ja £'1Z a - of 
gb: 8°82 ig 82 V'2z he ¥'0Z £°0% oe 
a V¥s ed €%z Liz O'1z 0°02 8°61 3 
9% Lvz Vz PES 9°22 8'1Z Zz 9°02 ze 
she sin ” aie " o'6r ‘81 O'sT 1g 
; , a Vez £°£Z Fa oc og 
me 7, oan = 
0°22 £'1Z 1z 
aI “IL II 501 or 546 6 48 8 KL L “9 ugaqWwoNn 
SUVA NI GOV Lograas 











(98) ysesD py woul DpDp /8)410 AyrDey Ur suDah ZI 0} SyyUOW g Wo4s (ADI-x Aq pasnsDvew “sw2) snuawny fo YyOUuIT 
(penuyu0D) 9 AIAVL 


























RS 
<>) 
= 
= 
~ 
~ 
= 
S 
































S100" 2g00° 8200" ¢¢00" Z100° 1g00° ££00° 6£00° 8200" 1900° 8200" (G-W—- DA 
o0sL9" €E8L" O¢L9° PPPS" 6889" ees" €RGL" OOLO'T S08T'T LLOe'T SZFO'T -y— 1 
8620" 6F10" 60Z0 800° ¢¢00" 0g00" 0f00° 1Z00° Z100° ¢100" 6000" (“"WY— ‘WA 
9sor" ¥£60 8920" 1FF0" 9EF0" 6820" 6F20° IZ10 1g10° Z110" 9900° 0600" (‘Wit’) 4 
SItl F260" 6820" I1g0" FORO" €LE0" 6920" SLI0° SFO" ZZ10° 1800" SFO" (‘W) A 
£189 g1eZ 61IL’ 998" SPOL 1006" 6818" 9FE0'T 6SEl'T $9621 8z99'l “iyi — Ww 
00Z9°6 S889'ST G296'LI T1t2L1 OLgE'9I OZOL'ST OS9Z' FI R016'EL ILPS'Z1 ZOPL IT LOSP'OL O108"8 vw ity 
10L¢°6 888981 O€S6'LT 9802 LT 890F' 91 1S99°ST LEZL FL LIgs’sI 19L8°ZI COFL IT Reor' Ol 0088"8 "W 
0000°0 IZ50" Slt" LEv0" 9740" 2Z80° 6160" ez" L060" rs ai) ColF 0000°T To) 
0000°T 6296" L¥Zz8 £996" F196" £L16" 1896" CFLS" £606" LSOS" GERg" 0000°0 ~ — I 
000°T 966" 866" 186° 66" 986" 0L6" OF6" 6L8° 618" PRL" = mv ‘W4 
Lg0'1 T80'T z96" 060°T rL6 890°T LOT o26" Slz'I gs0'T 839° = I-¥ “4q 
L9LE°% 008T'Z L9F'Z I8€0'T 09¢0'T #608" 166F" e20e" Ores" COFL OFE0" (¥7 ‘I-¥,)) aoa 
Z6bS'S 0910°Z 899z°% 6196" FPS0'I SLSL" FOFE 111g" gOS Leet C6F0" (-%)) A 
eles €886'T €108'I 669T'T ¢F90'I 906" LI¥9" Lyer PLZe" L092" z9eT" PSbl va 
“$e zy 2620°% “le Os6e" ¢ eK0'T CZRe" cé6xI PSP (9) A 
egies LLLET £eso'% PRL 98E0'T 98838" 8089" eres” 6188" PLES Is (1) A 
0¢z9'ST 0008" LT OSZI'LI SLLP'9I ZEz9'S1 OShF'FI 00¢9°ET 0009°Z1 £26911 OFRF'OI 62406 ; my 
000861 EFIO'ST SZLO'SI OOTS LI ooee’9T 9L9F'ST 000¢°FI PSI9'SI 00ZS°ZI COLL IT EFIS'OT 0088'S vu 
sis 0008'8T OSZZ'SI 0002'9T 000F LI 008291 0009°ST eeeg'el 000821 oOsZL'I1 62bE 01 OO8S'8 vo 
000861 £E8e'S1 0008" LI Z2ee' LT I11Z'9 6882'S €R0F' FI OOTO'EL 1eZs'Z1 E262 11 LS89'01 ” 
000°T Lss" 00s" 006" 006" sis" £26" 692" 198° L19L 00g" 0 Xx 
F L 8 or or Il gI eI cI LI FI ol N 
0 I ’ I I rd I £ z t L ol ) 
t 9 ¥ 6 6 6 ra or gI gI L 0 v7 
9 “KS g “uP ¥ Ke £ baad z KI if % 


SUVGA NI GDV 





(98) ysauv py woul vpvp /g9 9/qD., woul peyDpNz7D9 87410 Ur Sn4vwny fo YJOua] UO so1VE~VmY 


Z ATV 





157 


REPORTING OF GROWTH DATA 


aa 


or eS ee = 


a 


























$600" 00+0" £420" 8600" 1£00" $200" 1400" Z900° TOTO" 00+0" 120" ooro” (a — 1) A 
009 0006" 00g2" £882" 0009" LZL9" 1602" 8LL9" S792" 000%" £889" 0008 my 4 
2890" LEvI 8990" $10" 4600" 9£00° 9£00° 9900" $200" 1810" $610" gio) (“YW — *W) A 
OzPF" goss: LbLT" 2120" #850" £920" 9920" L810" 8F£0" 9640" £980" reso | WY) A 
OCF" SOse" 6681" L120" 6650" 1620" £220" 9020" GRE0" +690" 8F0I" 9790 OW) A 
19%" £1Zz't_ 9608" OIL’ cer «BESO O81" 8009" 2908" Z10F ¥S8b ae ate 
f08'2Z = SGTLT «= GUEG'ST = GLOSS «= ERTE'HS «= HRABTET «=—«OTIZ'ET «=—CHIIH'S «= GOERIZ «= T6001S Ss SETOST«LETOOS TY 
fv08'LS = SRST'ZS «= GBPS'ST = EGET = LOTE'HZ «= QGORET «=—sHETIET «= LIEW'GS «= RGTNIZ «= BOHG'OT = T6GH'OT ~=—81 80°08 ‘W 
00000 102T" 000006 ZE0" oget’ 2690" Lb01" $601 1881" 6Z1Z" 2460" gSze" e 
0000°T 668" 00007T = 1296" 0298" £086" £968" 9068" 6118 182" 8906" ¥bL9 %- 1 
6926" 
4 [[81249 F 
866" 000°T 266" 286" 626° 026" 696" 196" 926" 126" 416" 966" ean 
110'r 621° 6S1'I 200° 896" £10 696" 020°1 £0'l P88" 820° 10"! mw “Ng 
1909 OSzh'GeetHE |= ONO'S SCT TC" 0909" 629" F10¥" erect  ozsss =—_«OzOs'SC RBG (M7 “I"¥) 409 
z9ee'9 «SOR EESE'S ORGS ORGDT Z869" Zzo9" E92e" sort = Ozaweékawors:—<(iéiéOT' ND A 
ee1g'9 = 29889 «= OUSGE = SORT = OUST =— S088" 9782" sos" 9EZOT = ISSE'T «= LEZ BBETT ‘a 
0St9'Z a rm 9961°F  —— 0G09" 006c7% = SE0TT = ORO" 3 Oger" F276" (¥2) A 
ee1s'9 OszIo9 ooBe's ssezs  —«T TED" 0z99" 919g" PEL 629FT OLL9T EOS LESHE (2) A 
O0SL'9Z  OOSE'FZ HODGES  000N'FZ  IIITHS  ELZZET  LELETS POSIT  GLETIZ  OOZSOT  LOdFOS LEE! m4 
OO0FLZ  O0SL'9T  HOSIST  BSZE'ST  OOOSFZ  LLOIFZ  HZHO'ET OOOH  IEZO'ZS  OOZOTZ  0009'0Z  000EZ vu 
ES OOSE'RZ wh OO0E'SZ  OOLZHS  OOSTEZ  NO0OK'TZ LESS 0090'S  OOZI'IZ  OOSTEI FILEOZ ‘9 
ooo ze OOSI'SZ  OOSTET  LELESS IIMS  OOOKET  IBO'ET  SeZTZZ  000N'TS  0OZEOT  OOS6OS  LEETOZ " 
000° 00s" 000°! 298" 269" 98" 98z" ose’ S19" 00s" ose" oe" x 
b ¢ ¢ L eI gl ¥I at gl or 8 ol ‘N 
0 z 0 I ¥ z £ £ ¢ ¢ z L +f) 
¢ z ¢ 4 6 Ul I 6 8 S 9 £ ‘1 
a | “II Il %OI or 16 6 48 8 KL L %9 


SUVGA NI GOV 











(98) yseaD py wosf vpvp ‘9 21qD, , wosf pazppnozv2 87416 Ut snuawmeny fo Yy}0ua)] UO 82178170719 


(penuyu0D) 2 WIAVL 
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follows that of the previous example, except that the sums of the squared 
deviations have been omitted for economy of space and Minecri,2 and its 
variance are not given. The computations are similar to those in 
Example 1 and need not be repeated here. At age 64, however, there 
is an example of the calculation of 1— ¢, when Cy_, — 0 and C, ~ 0. 
The computation proceeds as follows: 

Since Cy_, = 0, we must use equations 0’, C” and C’”. We find first 





V (Mh-2) _ Pn-2 ead 
Vn-2 Ch-2 
.0469 
1 
= .0469. 


Then substituting this value in equation C” we have 


V(Mas)  1—?? V(M)-2) 
ED ene r? a ae Cc” 
Va-1 Nin - Vn-2 
— .9769? 
a 97697 &K .0469 


== .0562. 
Finally, substituting this value in equation C’ 


Th 
1 — pn = Ni — r=( nfl — Ly V (Mi )/Ur-a} 





we have 
3 
= 79 — 9769? X 1(1— 3 X 0562) 
— 6747. 





1 a. 


A similar calculation must be done to find ¢, at age 114 since Cy at 
11 is zero. 

The same difficulty arises in the computation of »,,My at age 6, 
when C,—0. We have then from equation F 


V 2.5133 


M 
ner, — 19.3701 — .9769 X ———— KIX V(is) {20.0818 — 20.3714}, 
V 1.9883 Vn 





| 
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and by equation C” 


V(M,) _1—.9769? »V (Mis) 
aR as Tf SO 





and by equation C’” 


V (Mos) __ Ges OFT oan. 


Whence 7A) = .0563 and »,,M, — 19.5200. 
‘h 
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METHODOLOGY IN THE STUDY OF PHYSICAL 
MEASUREMENTS OF SCHOOL CHILDREN * 


PART I 





BY B. G. GREENBERG AND A. HUGHES BRYAN 
University of North Carolina, Chapel Hill 


N PREVIOUS studies by the authors (3,4), it was recommended 
I that the technique of covariance be employed in the analysis of 
anthropometric characteristics of school children. It obviates the 
classical requirement for balancing large homogeneous groups of children 
according to age, height, or weight in order to ascertain whether or not 
observed differences in another variable are random fluctuations. The 
advantages of covariance in this type of application have also been 
pointed out recently by Kemsley (14) in a study of weight and height 
of a population. In the present study, it can be shown that the effect 
of adjustment for age by covariance was equivalent, at least, to doubling 
the number of observations. 

The ultimate aim of the continuing series of investigations is to 
examine the etiology, in terms of nutritional status, for physical differ- 
ences in school children of differing socio-economic backgrounds. In 
pursuing this objective, several problems in methodology have had to be 
resolved to assure maximum efficiency at each step. The purpose of the 
present paper is to report some of these results. They include the 
mechanics of the sampling used to obtain physical measurements, the 
optimal age group for study, and the physical measurements which 
appear to be worthwhile indices to adopt. 


a. Sampling technique used in obtaining measurements 


The following measurements were made on boys and girls: age 
(months), weight (pounds), height (centimeters), transverse chest dia- 


* A preliminary report of this paper was presented before a joint meeting of 
the Biometric Society and the Institute of Mathematical Statistics, March 17, 
1950, Chapel Hill, North Carolina. 
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meter at the level of the xiphoid (centimeters), bi-iliac diameter (centi- 
meters), and calf circumference (centimeters). Calf circumference 
may be interpreted as an indicator of the amount of muscle and sub- 
cutaneous fat, whereas the chest and bi-iliac diameters are gauges of 
skeletal stockiness. 

It was expected that there would be variability in these measurements 
due to the children selected, the observers (both within and between), 
and whether the children were dressed or undressed. A series of pilot 
studies were undertaken in which each of several trained observers made 
duplicate but independent measurements on a number of children. In 
certain of these studies, the children were first dressed and then 
undressed. Estimation of the separate components of variance was then 
made from each of these experiments. The results are listed briefly 
herewith. 

Except for the boys’ bi-iliac measurements, clothes made no signifi- 
cant change in the size of the differences between two group means. 
For example, one group of ten boys when dressed measured 0.8 centi- 
meters more than another group in the transverse chest diameter, whereas 
this difference was 0.9 centimeters when the boys were undressed. The 
change from 0.8 to 0.9 was not significant. On the other hand, the 
same ten boys measured 0.7 centimeters more than another group in 
the bi-iliac diameter while wearing clothes, and this difference decreased 
significantly to 0.5 centimeters when the children were undressed. In 
this manner, it was observed that the extent of the differences between 
group means would be little changed if all the children were dressed or 
undressed, with the exception of the bi-iliac diameter. The reason that 
the boys’ bi-iliac diameter measurements differed significantly might be 
explained by the variety in the types of trousers and coveralls worn at 
the time of measurement. (Of course, measurements must be taken at 
the same season of the year so that the type of garment worn will not 
otherwise disturb the equilibrium. ) 

Furthermore, the variance of the difference between group means 
was not significantly increased when clothes were worn. Such informa- 
tion was beneficial from the standpoint of both economy and avoiding 
the problem of convincing school authorities that children should undress. 
This was especially valuable in the case of girls. All subsequent measure- 
ments reported herein are based, therefore, on children who were dressed. 
Unfortunately, a correction factor for clothes cannot be assigned to the 
measurements reported in the main body of this paper, since the pilot 
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studies were undertaken with a different group of children and at varying 
times of the year. 

To minimize “ observer errors” in conjunction with the number of 
children expected to be in the sample at a later date, it was found that 
an overlap in measurement to the extent of one in three was desirable. 
That is, observer A should measure the first child, observer B the second 
child, and both A and B should take independent measurements on the 
third child. The only exception to this plan was in weighing children, 
where the almost complete absence of within-observer variability in the 
pilot study indicated such overlap was unnecessary. A description of 
the pilot studies and their composition is contained in the Appendix. 
The recommendation for a sampling design was based upon estimates 
of the components of variance derived from these studies and may be 
found in Table 10 of the Appendix. 


b. Optimal age groups for study 

The ages of children studied varied from 72 to 167 months (that is, 
6 years and up to but not including 14 years, which for brevity will 
be referred to as “6 thru 13 years”). This is a group of children 
ordinarily enrolled in an elementary school. Using the sampling plan 
in (a), two schools for white children in North Carolina from differing 
socio-economic backgrounds were studied. Children in school A came 
from urban families with presumably higher than average incomes, and 
those in school B came from rural families in a cash farming region. 
There were 292 children in A and 154 in B, and it was believed that each 
constituted a relatively homogeneous group with respect to socio-economic 
level.* 

The problem arose as to which age grouping was optimal for showing 
physical differences between the children from the two schools. Too 
large an age interval would violate the assumption of homogeneous vari- 


1 It was assumed that genetic or racial factors would not play a major role in 
any physical differences such as weight and height between these two schools. 
These children are from a group of families indigenous to North Carolina for 
several generations. Furthermore, studies by Boas (2) on the relative role of 
heredity and environment indicate that certain traits, such as the form of the 
head and face are much more strongly influenced by race than are other features 
such as height and weight. Also, Stuart (15) has stated, “ Several recent studies 
offer strongly suggestive evidence that dietary habits or factors associated with 
socio-economic circumstances may have more to do with the differences found to 
exist between racial groups than has the factor of race itself.” 
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ances within the group. For example, it has been shown by Burgess (6) 
that the standard deviation and coefficient of variation for height are 
functions of age. Too great an age grouping would also result in having 
the variable under consideration deviate from linear regression on age. 
This was undesirable computationally, since it was hoped to derive a 
simple routine for broad application. There are numerous small sample 
nutritional studies where an experimental group receives, say, some class 
of vitamins, and the effectiveness of the therapy is measured by com- 
paring physical measurement against a control group. It would be 
worthwhile as a secondary objective here to develop efficient yet simple 
techniques of comparison for such projects. 

Optimal was defined rather loosely in terms of several considerations, 
viz., the coefficient of variation, the number of children who would be 
in that age group in the two schools, the magnitude of the differences 
between adjusted group means, and the probability associated with the 
F-ratio measuring the significance of those differences. It is clear that 
a particular age grouping might have a low coefficient of variation, but 
if there is no real difference between mean height, or mean weight, of 
children in that age group from the two schools, the F-ratio would 
probably be not signicant. On the other hand, a high F-ratio would 
not imply, per se, a difference between group means which would be of 
biological value. 

The various age groupings which were tried were 72-107 months 
(6 thru 8 years), 108-143 months (9 thru 11 years), 132-167 months 
(11 thru 13 years), and 96-143 months (8 thru 11 years). It was deemed 
preferable to choose identical age groupings for both sexes, since school 
health programs are conveniently administered through the classroom. 
The optimal division for both boys and girls, in terms of the above 
considerations, was the 96-143 months age grouping. 

For boys alone, 108-143 months appeared about as suitable as the 
larger group inclusive of the eight-year-olds. It was decided to choose 
the 96-143 months group, however, when consideration was given to all 
the measurements adjusted for both age and height. For girls alone, 
72-107 months seemed slightly more desirable than 96-143 months, 
particularly when all the measurements were adjusted for both age and 
height. The latter was chosen in spite of this to retain uniformity with 
the boys. Furthermore, there were greater physical differences between 
the mean heights and mean weights in the 96-143 months group, which 
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made it more favorable from the standpoint of relating growth to 


nutritional background. 


Both boys and girls in the 96-143 months age group in school A were 
significantly taller and heavier for their age than those in school B of 
a comparable age group. These results are presented in Tables 1 and 2. 


TABLE 1 
Observed mean ages, heights, and weights for children from 


two schools in North Carolina by sex and age groups 




















wingorengs OBSERVED GROUP MEANS 
AGE GROUPINGS OF CHIL- 
DREN Age (mos.) Height (cms.) Weight (lbs.) 
A B A B Total A B A B 
Boys 
72-107 mos. 59 31 91.9 93.8 92.5 125.8 124.2 57.5 56.0 
(6 thru 8 yrs.) 
96-143 mos. 87 47 119.3 117.1 118.5 139.4 135.6 77.2 67.0 
(8 thru 11 yrs.) 
132-167 mos. 55 31 147.8 149.4 148.4 151.4 148.4 93.2 89.9 
(11 thru 13 yrs.) 
108-143 mos. 66 30 124.6 125.2 124.8 142.3 138.4 81.1 69.9 
(9 thru 11 yrs.) 
Girls 
72-107 mos. 49 20 90.5 89.3 90.2 125.0 121.8 58.0 49.6 
(6 thru 8 yrs.) 
96-143 mos. 72 44 #121.1 122.1 121.5 140.6 137.7 78.1 70.8 
(8 thru 11 yrs.) 
132-167 mos. 46 29 145.7 144.7 145.3 152.0 148.7 95.7 89.7 
(11 thru 13 yrs.) 
108-143 mos. 59 38 125.2 125.5 125.3 142.9 139.4 80.4 73.6 
(9 thru 11 yrs.) 





It can be seen from Table 2 that not only were the differences and 
the significance of the differences frequently a maximum at these ages, 
but the children in school B tended to diminish the magnitude of this 
differential as they became older, especially with respect to weight. Thus, 
as the two differing socio-economic groups of children grew older, the 
difference in mean height decreased slightly, and the difference in mean 
This led us to believe that the older children 


weight almost vanished. 


in school B grew heavier for their age and height than those in school A. 
This was tested, and the results are presented in Table 3. 
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Although such a trend was borne out by the data in Table 3, the 
differences were not significant. That being the case, the conjecture 
was made that there are factors other than age and height which 
influence the mean weight of differing socio-economic groups. 

Based upon the sampling scheme in part (a) and estimates of 


TABLE 3 
Group mean weight adjusted for age and height 





WEIGHT (lbs.) ADJUSTED FOR AGE AND HEIGHT! 











AGE GROUPINGS F-ratio of Coefficient 
A B A-B difference of variation 
Boys 
72- 107 mos. 57.2 56.4 0.8 0.33 11.8 
(6 thru 8 yrs.) 
96-143 mos. 74.8 71.4 3.4 2.33 18.5 
(8 thru 11 yrs.) 
132-167 mos. 92.1 93.5 —1.4 0.15 19.7 
(11 thru 13 yrs.) 
108-143 mos. 79.0 74.7 4.3 2.05 18.7 
(9 thru 11 yrs.) 
Girls 
72-107 mos. 56.6 §2.2 4.4 7.54** 11.8 
(6 thru 8 yrs.) 
96-143 mos. 76.2 73.2 3.0 1.58 18.8 
(8 thru 11 yrs.) 
132-167 mos. 92.3 92.9 —0.6 0.02 19.7 
(11 thru 13 yrs.) 
108-143 mos. 78.2 76.3 1.9 0.44 19.4 


(9 thru 11 yrs.) 





1 The means were adjusted for age and height differences indicated in Table 1. 
** Highly significant at 1% level. 


variation from the two schools, a table was prepared to show the 
number of children 96-143 months old required to establish significance 
of differences in weight and height measurements. For example, to 
establish a difference significant from zero when the true differer-e is 
at least one centimeter in height, adjusted for age, would require an 
equal number of 378 boys in each socio-economic group, in comparison 
to 465 girls that would be required in each; a difference of two centi- 
meters would decrease this number to 96 boys, or 118 girls, in each 
group, etc. These results are presented in Table 4, with supplementary 
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information for unequal groups in which one might be twice the size 
of the other. In this table, there is also presented the minimum number 
of observations which will, with a probability of 95 per cent establish 
differences of certain stated sizes to be significant at the 5 per cent level, 
The size of these latter groups is correspondingly larger, since an alter- 
nate hypothesis is specified with the Type II error fixed at 0.05. Tables 
from the paper by Harris, Horvitz, and Mood (10) were used for the 
purpose of checking the results. 

In preparing this table, consideration was given not only to the 
variation in height (or weight) but also to the sampling error of the 
regression on age. That is, the “effective error variance” per child 
was used. An average, or expected, value of this effective error variance 
in terms of the degrees of freedom for error, suggested by Cochran (7) 
in 1940 and defined by Finney (9) in 1946, was found to be an excellent 
approximation in this case. 

It should be mentioned that 96-143 months is considered a good age 
range by nutritionists for obtaining fairly reliable dietary information 
from the children themselves (5). This was a fortunate coincidence 
of age grouping in this instance, since the hypothesis that physical 
differences of growth and maturation may be explained in part by dietary 
patterns is to be tested in later work.” 


* Since body measurements are the result of nutrition and other factors over 
a period of years, it would be most desirable to study dietary patterns in children 
prior to the manifestation of physical differences. When the diets of younger 
children (e.g., pre-96 months) are studied, however, the question is raised 
whether physical differences of growth will occur later. To answer this unequi- 
vocally, a long-term follow-up study of selected children is necessary. Under 
present circumstances, an involved study of this caliber was not possible. 

Suppose, however, that the physical differences observed in the present instance 
represent a characteristic of the age grouping 96-143 months itself and that such 
differences will be repeated in the future as other cohorts of children reach this 
age. In that case, it would be sound procedure to substitute for the long-term 
study by investigating the present diet in the pre-96 months group. It is possible, 
however, that the observed variation in physique applies only to the particular 
children involved at the present time. To determine which of these two alterna- 
tives is more likely would require waiting until younger children reached the 
96-143 months age group. From the standpoint of time involved, that solution 
would be analogous to a long-term follow-up study. 

Consequently, it is felt that a compromise must be made by studying the diets 
of those children who differ in physique now, viz., those in the 96-143 months 
group. To attempt an investigation into the nutritional history of this group is 
obviously impractical. Recognizing, however, that dietary patterns are habit- 
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It is also an age range in which sexual maturity is a very important 
factor determining growth. Indeed some or all of the between group 
differences in body measurements reported in this paper might arise 
from variation in the timing of the adolescent growth spurt in the 


TABLE 5 
Group mean bi-iliac diameter adjusted for age and height 





BI-ILIAC DIAMETER (cms.) ADJUSTED FOR AGE AND HEIGHT! 
F-ratio of Coefficient 





AGE GROUPINGS 








A B A-B difference of variation 
Boys 
72-107 mos. 20.5 20.3 0.2 1.29 4.3 
(6 thru 8 yrs.) 
96-143 mos. 22.4 22.2 0.2 0.65 6.0 
(8 thru 11 yrs.) 
132-167 mos. 24.1 24.5 —0.4 1.74 6.5 
(11 thru 13 yrs.) 
108-143 mos. 22.9 22.7 0.2 0.20 6.3 
(9 thru 11 yrs.) 
Girls 
72-107 mos. 20.1 19.8 0.3 1.67 4.8 
(6 thru 8 yrs.) 
96-143 mos. 22.4 22.3 0.1 0.20 5.2 
(8 thru 11 yrs.) 
132-167 mos. 24.5 24.7 —0.2 0.36 6.0 
(11 thru 13 yrs.) 
108-143 mos. 22.7 22.7 0 0.01 5.4 


___ (9 thru 11 yrs.) 





1 The means were adjusted for age and height differences indicated in Table 1. 


groups compared. We are now engaged in the study of methods suitable 
for use in determining the mean age of the pubertal growth spurt of 
groups of children. 


c. Indices of body measurements 

Using the age classification 96-143 months, group mean heights and 
weights adjusted for age readily indicated differences in the growth 
and physique of the two groups of children. For both boys and girls, 
these two measurements were highly significant in the power to dis- 
forming and remain relatively unchanged over short periods of time, it seems 
reasonable to assume that any dietary differences of former years between these 


two socio-economic groups would still be detectable with a fair degree of accuracy, 
In future work we will investigate, therefore, the diets of the 96-143 months group. 
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criminate between the two socio-economic groups. They are fairly 
simple routine measurements requiring little specialized training. From 
Table 2 it can be seen that children of both sexes in school A were 
3 to 31% centimeters taller, and over 8 pounds heavier than children of 
the same age in school B. 

The other measurements—chest, bi-iliac, and calf circumference— 
were also significantly different between the two schools when adjusted 
for age. The bi-iliac differed by approximately 34 centimeter, the chest 
by \% centimeter, and the calf by 1 centimeter. These differences con- 


TABLE 6 
Group mean transverse chest diameter adjusted for age and height 





CHEST DIAMETER (cms.) ADJUSTED FOR AGE AND HEIGHT! 

















AGE GROUPINGS F-ratio of Coefficient 
A B A-B difference of variation 
Boys 
72-107 mos. 19.8 19.9 —0.1 0.14 5.3 
(6 thru 8 yrs.) 
96-143 mos. 21.3 21.3 0 0.04 5.8 
(8 thru 11 yrs.) 
132-167 mos. 22.8 23.3 —0.5 2.51 6.7 
(11 thru 13 yrs.) 
108-143 mos. 21.6 21.5 0.1 0.18 5.6 
(9 thru 11 yrs.) 
Girls 
72-107 mos. 19.1 19.1 0 0.01 5.6 
(6 thru 8 yrs.) 
96-143 mos. 20.7 20.9 —0.2 0.77 7.2 
(8 thru 11 yrs.) 
132-167 mos. 22.0 22.4 —0.4 1.58 7.8 
(11 thru 13 yrs.) 
108-143 mos. 20.9 21.1 —0.2 0.83 7.6 


(9 thru 11 yrs.) 





! The means were adjusted for age and height differences indicated in Table 1. 


firmed the findings on weight and height but provided little additional 
information of discriminating value between the two schools. 

When weight was adjusted for both age and height, it was indicated 
in Table 3 that there no longer appeared a significant difference between 
the two schools. (Girls 72-107 months were an exception to this. The 
probable origin of this weight difference will be discussed presently.) 
Similarly, the chest and bi-iliac diameters were not significantly different 
between the two schools when adjusted for age and height. These results 
are given in Tables 5 and 6. 
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The calf circumference, nevertheless, did indicate a significant 
(P = 0.02) difference of almost 1 centimeter between boys in the two 
schools when adjustments were made for both age and height. The 
results are presented in Table 7. In fact, the difference between the 


TABLE 7 
Group mean calf circumference adjusted for age and height 





CALF CIRCUMFERENCE (cms.) ADJUSTED FOR AGE AND HEIGHT! 











AGE GROUPINGS F-ratio of Coefficient 
A B A-B difference of variation 
Boys 
72-107 mos. 24.9 24.3 0.6 3.68 6.3 
(6 thru 8 yrs.) 
96-143 mos. 27.6 26.7 0.9 5.12* 8.6 
(8 thru 11 yrs.) 
132-167 mos. 29.8 29.6 0.2 0.19 10.0 
(11 thru 13 yrs.) 
108-143 mos. 28.1 27.1 1.0 3.35 8.8 
(9 thru 11 yrs.) 
Girls 
72-107 mos. 25.2 24.3 0.9 6.71* 6.0 
(6 thru 8 yrs.) 
96-143 mos. 28.0 27.4 0.6 2.08 8.8 
(8 thru 11 yrs.) 
132-167 mos. 30.1 30.1 0 0.01 9.3 
(11 thru 13 yrs.) 
108-143 mos. 28.3 27.9 0.4 0.60 9.2 
(9 thru 11 yrs.) 





1 The means were adjusted for age and height differences indicated in Table 1. 
* Significant at 5% level. 


mean calf circumference of the two groups of boys was reduced only 
slightly when the correction for height was introduced in addition to age. 
The difference in the girls’ calf circumference between school A and 
school B was not as great as that of the boys in the age group 96-143 
months, nor was it significant. 

For girls 72-107 months, however, there was a highly significant 
difference of about 1 centimeter between the mean calf circumference of 
the two schools. It was felt that this might explain the source of the 
weight differential observed at this age when adjustments were made 
for age and height. When the two groups of girls were compared in 
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mean weight after adjustment for age, height, and calf, there appeared 
no difference whatsoever. 

These results with the circumference of the calf suggest that such 
measurement is possibly another index useful in pointing out physical 
differences between socio-economic groups when adjustment is made for 
both age and height. Variations in the muscle mass were not always 
reflected in weight changes after adjustment had been made for age and 
height. For example, boys 96-143 months differed significantly in the 
mean calf circumference but not in weight when adjusted for age and 
height. 


d. Use of factor analysis to measure an index of physical structure * 


It was indicated in the previous section that a suitable description 
of the physical characteristics of a group might be obtained by limiting 
measurements to weight, height, and calf circumference, with corre- 
sponding adjustment for age. It is of interest to see if this recom- 
mendation can be confirmed in studying the problem from another 
approach. 

The technique of multiple factor analysis devised by Thurstone (16) 
is familiar to workers in psychology and allied fields. Its applicability 
to a medical problem has been indicated by Andrews (1). It was 
probable that the method would also be of value in this instance.‘ 

The purpose was to derive a composite index for measuring general 
physique from several variables describing certain elements of physical 
structure. Our desire was to determine, at least, which variables in 
that index are the most important or have the greatest influence. 

According to the principles of factor analysis, suppose a set of n 
variables is associated with each individual of a population. These 
variables might be scores made by persons in tests of speed and ability 
in solving certain types of problems, or they might be various physical 
properties, as in the present instance. These variables are usually 


* This investigation was supported (in part) by a research grant from the 
National Institutes of Health, U. S. Public Health Service. 

‘Since writing this section, the authors have received an abstract of doctoral 
dissertation T883, by Helen Heath, University of Chicago, on “ A factor analysis 
of women’s measurements taken for garment and pattern construction.” The 
dissertation and references cited therein used methods of factor analysis on 
extensive lists of physical measurements. As far as these references could be 
checked, their problem was the classification of body types, an objective somewhat 
related to but different from the one in the present instance. 
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correlated among themselves to a considerable extent. The question 
arises whether it would be possible to reduce this set of n variables to 
a more fundamental and basic set, fewer in number than n, which will 
automatically determine the values of the n variables. There are always 
n possible sets, called principal components. 

In psychology and educational testing, the first set or first principal 
component has been called a mental factor, such as general ability. It 
appeared reasonable to expect that by using factor analysis methods 
with anthropometric data that the first principal component would 
represent an index of general physique. If so, the first factor loadings 
(see a; below) would indicate the comparative importance of each 
anthropometric variable in that index or component. 

The philosophy behind the application of the method in this example 
may be summarized as follows: 


Let 
Sy = A121 + Ge22 + Ag%s + Ay2q + A525 
where 
S, =the value obtained as a score from the first principal com- 


ponent, and purported to be an index or score of physical 
structure, 


z, = an individual’s weight expressed as a standard deviate from 
the group mean weight, 


z, = an individual’s height expressed as a standard deviate from 
the group mean height, 


z; = an individual’s bi-iliac expressed as a standard deviate from 
the group mean bi-iliac, 


z, = an individual’s chest expressed as a standard deviate from 
the group mean chest, 


z; = an individual’s calf expressed as a standard deviate from 
the group mean calf, 


a, = first factor loading for weight, 


Ga om “ “ “ “ height, 
a4, “ “ “ “ bi-iliac, 
a— “ ms ™ “ chest, and 
as — “ “ “ “ ealf. 


As a result of expressing the n variables as standard deviates with 
mean zero and unit variance, the total variance of all possible principal 
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components can be written as n. The percentage of total variation 
explained by the first principal component is obviously 


100V (S,)/n = 1003a,?/n. 


The criteria that would be used in discriminant function analysis 
to estimate values of the a’s are not applicable in factor analysis. A 
different type of restriction was resorted to in this case, since no 
dependent variate was available. 

It was assumed that a homogeneous group of individuals would have 
a common type of physical build, expressed by an identical score, Sj. 
Based upon this assumption, estimates of the a’s were made so that the 
variation among these scores was a maximum. ‘This was the criterion 
used by Hotelling (13) and Horst (12) in evaluating the coefficients 
in the index. The same results would be obtained, however, by using 
the principle of Edgerton and Kolbe (8) of minimizing the variation 
in score for each individual. In other words, children with similar 
scores were members of the same group, and the a’s were chosen so that 
the scores were similar within a known homogeneous group.°® 

Insofar as the problem here was concerned, interest was centered 
on the relative magnitude of the a’s for the first factor. Our primary 
concern was with the contribution of each of the five variables in 
determining the physical score. 

The method of Hotelling, based on product-moment correlations of 
all possible combinations of the five variables, was used. The only 
change here was that the correlations in the matrix were partial corre- 
lation coefficients. This removed the variations caused by age in all 
of the zero-order correlations. This had no effect on the results or 
interpretation of the procedure. 

The method of analysis was applied to both schools, boys and girls, 
and two age groups. A listing of the zero-order correlation coefficients 
for the five variables plus age in months is given for the age group 
96-143 months in Table 8. The result of the calculations to estimate 
the a’s is given in Table 9. The relative position of the three largest 


® There is difficulty here in that we do not actually know how homogeneous 
our groups are with regard to physical structure and other characteristics. 
Further studies in school A have led us to believe that the group is not entirely 
homogeneous, although as a group the children are markedly dissimilar from 
those in school B. 
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values of the a’s in each group is indicated in parentheses below the 
value itself. 

It can be seen at a glance that a,, the factor loading for weight, 
was uniformly the largest coefficient in each of the eight indexes. That 
is, weight of an individual was the most important measurement to 
describe general physique. It can be seen from the last column in 


TABLE 8 


Correlations among physical measurements and age, 
using data for schools A and B, boys and girls, aged 96-143 months 








VARIABLE AGE WEIGHT HEIGHT BI-ILIAC CHEST CALF 
Agp sees 0.5671 0.7086 0.5188 0.5430 0.5247 
eye e 0.5139 0.7058 0.6285 0.4531 0.4375 
Weight a 0.7772 0.9096 0.8727 0.9547 
0.5655 ~~ ......... 0.7064 0.8726 0.8904 0.9389 
Height 0.6814 0.8359 ~—=«.......... 0.7702 0.7298 0.7030 
0.7583 5 ers 0.8035 0.5859 0.6053 
Bi-iliac 0.5747 0.7881 5 ee 0.7984 0.8293 
0.6234 0.9660 ne 0.8049 0.8040 
Chest 0.4724 0.7789 0.6361 0.6250 ._—«........ 0. 8432 
0.3644 0.8238 0.5635 eee =o wewade 0.8223 
Calf 0.4649 0.9081 0.6260 0.6351 0.6600 ........ 





Note: Upper figure in each pair is for boys. 
Lower figure in each pair is for girls. 
Figures above the diagonal represent school A. 
Figures below the diagonal represent school B. 


Table 9 that between 65 per cent and 80 per cent of the total variation 
in these five measurements was explained by the factor of general 
physical structure. Weight by itself, however, explained one-fourth 
of the variation found in the first principal component. That is, 


a,*/Xa;* = V4, 
on the average. 

Another striking point in Table 9 was that the measurement of 
height, especially for school A, was relatively unimportant. Instead, 
the calf and bi-iliac seemed most sensitive or descriptive in that order. 
Among the indexes created for school B, it can be seen that height and 
calf were the two heavily weighted measurements following that of 
weight. 

When the eight indexes were examined from the standpoint of sex 
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and age, there were two conclusions which seemed of interest. The 
first is that older girls, 96-143 months, appeared to be best described 
by weight, bi-iliac, and calf, in that order. The second was that for 
all boys the weight and calf were best, and the third depended on socio- 
economic class—for school A it was bi-iliac, and for school B it was 
height. 


TABLE 9 


First factor loadings for five physical measurements, 
using factor analysis .n selected groups of school children 














PERCENTAGE 
FIRST FACTOR LOADING OF TOTAL 
AGE VARIATION 
SEX (months) a, ay a3 as ds EXPLAINED 
BY FIRST 
Weight Height  Bi-iliac Chest Calf FACTOR 
School A 
72-107 0.9615 0.8571 0.9181 0.8642 0.8876 80.7 
(1) (2) (3) 
Boys 96-143 0.9683 0.7608 0.9092 0.8804 0.9164 79.2 
(1) (3) (2) 
72-107 0.9529 0.7794 0.7448 0.8199 0.8578 69.6 
(1) (3) (2) 
Girls | 96-143 0.9648 0.6823 0.9126 0.8839 0.9114 76.8 
(1) (2) (3) 
School B 
72-107 0.9723 0.9066 0.8441 0.7997 0.8703 77.5 
(1) (2) (3) 
Boys | 96-143 0.9613 0.8106 0.8036 0.7700 0.8303 70.2 
(1) (3) (2) 
72-107 0.9145 0.8362 0.8199 0.7607 0.6930 65.3 
(1) (2) (3) 
Girls | 96-143 0.9816 0.7555 0.9621 0.8381 0.9096 79.8 
(1) (2) (3) 





e. Summary and discussion 

Summing up, the results indicated that children 96-143 months 
(8 thru 11 years) of age from upper socio-economic classes were taller 
and weighed more for their age. They were not heavier for their age 
when consideration was made for the fact that they were taller. The 
boys in the upper socio-economic class appeared to have more muscle 
mass and subcutaneous fat, as judged by calf circumference, for their 
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age and height. This difference was not sufficiently great to cause 
significant weight changes. Younger girls, 72-107 months (6 thru 8 
years), in the upper socio-economic group appeared to have more muscle 
mass and subcutaneous fat of an amount sufficiently great to cause 
weight differences for their age and height over those in a different 
socio-economic class. As a physical discriminant between socio-economic 
groups,° the chest measurement offered little additional information 
which was not already provided by weight and height and possibly calf 
circumference. The bi-iliac measurement may have additional value in 
certain groups, such as girls from higher socio-economic classes and aged 
96-143 months (8 thru 11 years). 

In future work, it is hoped to make comparisons of growth with 
dietary patterns and stages of puberty. As Hogben, Waterhouse, and 
Hogben (11) have pointed out, “ Height and weight norms exhibited 
without regard to the level of sexual development may be deceptive as 
standards for growth comparisons.” 
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APPENDIX 


The pilot studies upon which the sampling design was based con- 
sisted essentially of two main types. In the first series, each of ten 
boys, aged 9-10 years, in a nearby orphanage was subjected to physical 
measurement by each of two observers. The children were first dressed 
and then undressed. Each measurement was also done in duplicate. 
These measurements were used in the study of the importance of clothes 
for group differences as well as estimating between-observer and within- 
observer variabilities. 

It was only at the orphanage that the children were measured without 
clothes. In a second series of experiments, five boy and five girls from 
the fourth grade of school B were submitted twice to each of two 
observers in random sequence for measurements of height, bi-iliac dia- 
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meter, transverse chest diameter and calf circumference. In the case 
of weight, a different observer made two independent weighings of each 
child. The figures obtained were studied by analysis of variance to 
determine estimates of the separate components of variance. It was 
postulated that the variation in body measurements would be a combina- 
tion of the components due to individual children, variability due to 
different technicians, and the repeatability of a particular observation 
by the same technician. Estimates of the components of variance made 
from these studies are presented in Table 10. It should be pointed out 


TABLE 10 


Estimation of components of variance in measuring school children 





CHARACTERISTIC eTWEEN SUBJECTS BETWEEN OBSERVERS WITHIN OBSERVERS 
AND UNIT OF se 














MEASUREMENT Boys Girls Boys Girls Boys Girls 
Height (cms.) 59.84 63.51 0.00 0.04 0.50 0.02 
Bi-iliac (cms.) 1.58 1.88 0.05 0.00 0.06 0.02 
Chest (cms.) 1.03 0.52 0.01 0.28 0.04 0.05 
Calf (ems.) 4.40 7.35 0.01 0.02 0.01 0.01 
Weight (Ibs.) 108.80 323.60 ° . 0.00 0.00 





* This portion omitted from pilot study. 

Note: These values were based on small numbers of degrees of freedom, and their 
erratic nature indicates that they should be considered only as preliminary 
estimates. 


that these estimates are quite variable inasmuch as they are based on 
relatively few observations. No attempt was made to place confidence 
limits about these estimates. 
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AN APPLICATION OF STATISTICAL METHODS TO 
THE EXAMINATION OF SPUTUM FOR 
NEOPLASTIC CELLS * 


BY EUGENE K. HARRIS 


INTRODUCTION 


XAMINATION of various secretions for neoplastic cells has 
KH received considerable attention in the past fifteen years in view 
of the possibilities of diagnosis before metastasis has occurred. This 
is especially important in primary bronchogenic carcinoma in which 
only total pneumonectomy seems to offer hope of recovery. 

It has been recommended that routine sputum examinations be 
performed on all patients suspected of carcinoma of the lung. Thorough 
examination of sputum smears is a time-consuming task and cannot be 
routinely undertaken by a busy pathologist. Therefore, suitably trained 
non-medical technicians must be employed to screen smears, bringing 
all suspicious cell groups to the attention of the pathologist for final 
diagnosis. 

At present there exists no information concerning the diagnostic 
value of a single sputum specimen or smear when handled and examined 
by a trained technician. Moreover, the possibility of variation in the 
diagnoses of several technicians independently examining the same smears 
has not been explored. Coupled with this “ inter-individual ” variation 
is an “ intra-individual ” variation in diagnosis which may arise when 
one technician reads the same smear on two separate occasions. 

The present study has investigated these problems statistically in 
hopes of arriving at quantitative criteria which would enable the routine 
cytologic examination of sputum smears to be based on an efficient 
design and to yield accurate results. 


* This study was carried out in the Department of Public Health, with the 
cooperation of the Department of Pathology of Yale University, and was 
supported by a grant from the National Cancer Institute, U. S. Public Health 
Service. 
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The most comprehensive studies done to date on the subject of 
neoplastic cells in sputum have been those of Wandall (1944), Farber 
et al. (1948), Liebow et al. (1948), and Woolner and McDonald (1949). 

All of these workers used the wet-film technique introduced by 
Dudgeon and Wrigley (1935), followed either by haemotoxylin-eosin or 
Papanicolaou polychrome staining. The studies of Farber et al., and 
Woolner and McDonald are noteworthy because of the very large number 
of patients examined, the great majority believed to have non-neoplastic 
lesions. However, the reports of these workers indicate a lack of dis- 
crimination between results from patients known and unknown to the 
observer. Liebow has pointed out that the pathologist must have no 
knowledge of the clinical status or even the age of the patient in order 
to insure a completely unbiased cytological diagnosis. Only Wandall’s 
study seems to escape this criticism. 

Results of the four studies mentioned are summarized in Table 1. 


TABLE 1 


Review of previous results in the examination of sputum for neoplastic cells 








SPECIMENS SLIDES CORRECT FALSE 
PER PER PATIENTS PATIENTS POSITIVE POSITIVE 

OBSERVER PATIENT SPECIMEN WITHCA. WITHOUTCA. DIAGNOSES DIAGNOSES 

Wandall 3 5-6 100 93 84/100 6/93 
(84%) (6%) 

Farber et al. 3 3-5 66 400( 7) 54/66 ? 
(82%) 

Liebow etal. 2-3 ] 49 59 21/49 3/59 
(43% ) (5%) 

Woolnerand 1-2 5 180( 7?) 1400( 7) ? 

McDonald 80% * 





* Estimated by the authors. 


MATERIALS AND METHODS 


The wet-film method of preparing sputum smears was used through- 
out the present investigation. The patient was given a sterile Petri 
dish and carefully instructed to raise sputum from “ deep down in the 
chest ”—the trachea and bronchi. He was asked to rinse his mouth 
with water some minutes prior to coughing. This helped to remove 
food particles and superficial cellular detritus from the oral cavity. In 
the present study all collections were made in the morning between 
nine and eleven o’clock. 
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Dishes were gathered in about fifteen minutes and immediately taken 
to the laboratory. The sample of sputum was inspected against a black 
background to facilitate the separation of solid or blood-tinged portions 
from the surrounding mucous. With a wire loop likely bits of sputum 
were gently smeared in a thin film on a clean glass slide. It was 
necessary to flame the loop between transfers. 

Smears should not be made by rubbing sputum between two slides 
for there is then danger of disrupting cell groups and injuring individual 
cells. Blood-tinged portions of the specimen should always be selected 
since these are most apt to harbor neoplastic cells. If the sputum is 
not blood-streaked, the slides should represent random selections from 
solid material in different regions of the dish. In this study three 
slides were made from each specimen. 

The slides were immersed while wet in a solution of equal parts 
of ether and absolute alcohol. This fixative was first suggested by 
Papanicolaou (1933) for use in preparing vaginal smears. Slides should 
be kept in the fixative for at least one-half hour; practice in this 
laboratory has shown that they may be left as long as a week without 
harmful effects. 

The staining procedure represents a slight modification of that 
originally introduced by Papanicolaou (1942) : 


1. Pass slides through successive changes of 95 per cent, 80 per cent, and 
50 per cent alcohol (two minutes in each change). 
Stain for 5} minutes in filtered aqueous solution of Harris haemotoxylin. 
Rinse for one minute in distilled water. 
“ Blue” for one minute in 0.1 per cent ammonia water. 
Rinse again for one minute in distilled water. 
Dehydrate in successive changes of 50 per cent and 80 per cent alcohol 
(one minute in each change). 

7. Stain for 14 minutes in unfiltered Orange G solution, consisting of: 


ye 


ad 


a See rer er eT reer 1.00 gms. 
Phosphotungstic acid............. 0.05 gms. 
95 per cent alcohol................ 200 ce. 


8. Rinse for one minute in 95 per cent alcohol. 
9. Stain for two minutes in unfiltered EA-50 solution consisting of: 


DN 6 bd o088 den sde vei scucs 1.00 gms. 
WiMhGTOK BROWER... .. 2... sccccvens 0.16 gms. 
>, Ee ere ee ee ere 0.84 gms. 
Phosphotungstic acid............. 0.68 gms. 
95 per cent alcohol............... 380 ce. 
i PP eee rere 20 ee, 
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10. Dehydrate in two changes of 95 per cent alcohol and two changes of 
absolute alcohol (one minute in each change). 


11. Clear in two changes of xylol (five to fifteen minutes in xylol is suitable). 
12. Mount in Permount. A 20- or 22-mm. x 40 mm. cover slip may be used. 


After staining, the slides were submitted to systematic microscopic 
study with the help of a mechanical stage. In this way every cell field 
under the cover slip was scrutinized. Low power magnification (X< 120) 
sufficed for the bulk of the observation, but all suspicious fields were 
subjected to higher power (X 440) before an opinion was reached. All 
atypical cells or cell groups which appeared as if they might come from 
a carcinoma were shown to the consulting pathologist for confirmation. 

In this study each slide was identified by number at the time the 
smear was made. However, examination took place from one to several 
days after the recording of the number against the name of the patient. 
By this time the observer had lost all remembrance of number-patient 
correspondence. Moreover, slides were examined in batches of twenty, 
each batch including specimens drawn from several patients. As an 
added precaution, the numbers were masked with gummed labels and 
the slides rearranged in random order just before microscopy. Thus, 
the writer feels justified in claiming that, at the time of examination, 
he had not the slightest information about the source of any of the 
slides, nor could the opinion reached on one slide possibly influence his 
decision on another slide from the same specimen. 

It has been stated that one of the objects of this investigation was 
to determine inter-individual and intra-individual variation among tech- 
nicians trained to read sputum smears routinely. To this end, three 
such technicians agreed to cooperate with the writer in forming a group 
of four non-medical observers with special skill in sputum cytology but 
otherwise no formal experience in pathology. 

One of these technicians had had sufficient experience with sputum 
smears in the pathology laboratory of the Hartford Hospital, Hartford, 
Connecticut, to allow admission to the study without further examination. 
A second technician had less experience with the technique but after 
some preliminary training was able to report on a test set of slides with 
sufficient accuracy to insure reliable data in the study proper. 

One other technician, however, was completely new to the technique 
and passed through a training period of about three months. During 
this time she received intensive instruction and experience in the 
collection, preparation, and examination of sputum specimens and was 
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acquainted with the various clinical and surgical aspects of cancer of 
the lung. At the end of three months, following a satisfactory per- 
formance on a final test set of slides, she was admitted to the study. 

Each of the three cooperating technicians has independently read 
and reported on one hundred slides selected for distribution. Moreover, 
two technicians examined each slide twice, the second examination 
separated from the first by an interval of approximately two months, 
These technicians were unaware of the fact that they were making repeat 
examinations of each slide but rather were under the impression that 
they were being asked to report on two hundred different slides. Slides 
were distributed in random order in batches of twenty and their identities 
masked by code numbers so that the reading on one slide could in no 
case affect a subsequent examination of a slide from the same specimen, 

The writer, on previous examination, had noted positive fields, con- 
firmed by the pathologist, on thirty-four of the one hundred slides 
selected. The remaining sixty-six were thought to be negative. How- 
ever, since the writer felt that, acting as liaison between the other 
technicians, he had become too familiar with this set of slides, duplicate 
readings were made with a separate group of one hundred slides. In 
this case a period of some six months had elapsed between readings. 
The usual precautions of randomization and coding were taken. Any 
field which a technician thought positive or very suspicious was brought 
to the attention of the pathologist for confirmation. If the pathologist 
considered the field non-neoplastic, the slide as read by that technician 
was designated negative. 

This circumstance leads to an important conclusion which should 
be emphasized. It is impossible for a technician to be charged with a 
false positive slide since final judgment on any suspicious field must 
come from the pathologist. Therefore, he alone is responsible for any 
false positive reported. On the other hand, the occurrence of a false 
negative slide (i.e., a slide containing neoplastic cells which has been 
incorrectly diagnosed negative) will usually be the fault of a technician 
who, in examining the slide, has missed a positive field present on it. 
Such a false negative is revealed only when another technician finds a 
positive field on the slide which has been called negative, or when the 
technician originally at fault discovers a positive field upon a subsequent 
re-examination of the same slide. Of course, on occasion a false negative 


will occur when a pathologist incorrectly reverses the decision of a 
technician who has brought to his attention a field actually positive. 
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In short, the function of the technical aide in sputum cytology work 
is to screen the great mass of negative smears and present to the 
pathologist only those cell groups which appear suspicious or frankly 
positive. Naturally, neither person is infallible. These considerations 
will be enlarged upon later in this paper. 

The cytology of sputum has been described most exhaustively by 
Wandall although every recent report on the examination of sputum 
in the diagnosis of lung tumor has contained a review of the various 
types of cells encountered. Since any realistic description of these cells 
must of necessity be lengthy and detailed, the subject has been omitted 
entirely in an effort to focus attention on the statistical methods 
employed in evaluating sputum diagnosis. A fairly complete discussion 
of the cellular content of sputum, aided by microphotographs, has been 
included elsewhere (Harris, 1950). 


‘ 


THE DIAGNOSTIC VALUE OF SPUTUM EXAMINATION 


The present study includes material drawn during the course of 
about eighteen months from 152 patients who were hospitalized on the 
chest service of the Grace-New Haven Community Hospital. The 
presence of carcinoma of the lung has been confirmed in 63 of this 
group by microscopic examination of tissue received at bronchoscopic 
biopsy, operation, or autopsy. In five other patients no tissue diagnosis 
of cancer was available, but clinical and roentgenographic examinations 
leaned so heavily in that direction that these patients have been included 
in the group positive for cancer. 

Six patients were proven by tissue diagnosis to be suffering from 
cancer of the esophagus. Since these patients yielded sputum which 
was in every case negligible in amount and unfit for examination, they 
have been omitted from the study. 

Metastatic carcinoma of the lung has been proven in five patients. 
The primary sites were, respectively, the pharynx, prostate, rectosigmoid, 
cervix, and liver. In one case neoplastic cells were found in the sputum. 
However, it cannot be assumed that sputum diagnosis failed in the other 
four patients since the metastatic growth may not have communicated 
with a bronchus. In fairness, therefore, all five patients were omitted 
from the group considered positive for bronchogenic carcinoma. 

In forty cases the presence of chest pathology other than carcinoma 
was confirmed by examination of specimens removed at operation. In 
27 other cases where no microscopic proof was available the previous 
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history of the patient, the clinical and roentgenographic findings, and 
the subsequent course of the disease strongly indicated that the patient 
was suffering from non-neoplastic disease. 

Finally, there remains a small nucleus of 6 patients about whom no 
final conclusion can be reached. In three cases bronchoscopy revealed 
masses in the main or intermediate bronchi, but biopsies in two of these 
patients were reported showing only chronic inflammatory reaction and 
in the third patient merely bronchial mucosa. In all 6 cases a clinical 
or roentgenographic examination which admitted the possibility of tumor 
diminished in importance in view of a favorable hospital course and 
follow-up in the clinic. These individuals have been under observation 
for lengths of time varying from twelve to twenty months. In no case 
has a death certificate been received. 

An effort was made to obtain three sputum specimens from each 
patient included in the study. In some cases this was impossible because 
of early discharge or because the patient underwent surgery almost imme- 
diately after admission. Three specimens were obtained from 63 of the 
68 patients positive for cancer of the lung. Three patients yielded two 
specimens while only single specimens were obtained from the remaining 
two patients. The total number of sputum specimens obtained from 
these patients was 197. 

It would be unwise to estimate statistically the results of missing 
specimens in the two cases in which only one specimen was obtained. 
This would in fact be “ filling in” two-thirds of the data. However, 
for some of the analyses to follow in this chapter, it would be helpful 
to estimate the result of the missing third specimen in those cases in 
which no more than two specimens were obtained. 

In order to estimate the probable result of a missing third specimen, 
the patient in question may be compared with other patients with cancer 
who yielded three specimens. In each of the three patients under con- 
sideration the two specimens obtained were both positive. Investigation 
revealed thirteen patients yielding three sputa of which the first two were 
likewise positive. The third specimen was reported positive in nine of 
these cases, negative in four. Applying a similar ratio to the three 
patients with missing specimens, it will be assumed that a third specimen 
would have been positive in two of these cases and negative in one. No 
attempt will be made to estimate the results of particular slides from 
these missing specimens. 


The data which now follow concern the number of positive specimens 
} | 
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and slides obtained from cases of proven cancer. These figures should 
be understood as the results of microscopic examinations by a well- 
trained technician subject to the human variations analyzed later in 
this paper. 

Table 2 shows the number of patients with cancer yielding one, two, 
or three positive sputum specimens. These data include the estimated 
results of missing third specimens. However, the two patients who 
yielded only single specimens are not included. 


TABLE 2 


Numbers of patients with carcinoma yielding 0, 1, 2, or 3 positive 
sputum specimens 








NUMBER OF NUMBER OF PATIENTS 
POSITIVE 
SPECIMENS Observed Calculated 
0 31 18.7 
1 13 29.3 
2 11 15.3 
3 11 2.7 
Total 66 66.0 





Thirty-one patients did not offer any positive specimens. Therefore, 
53 per cent of a total of 66 patients with carcinoma of the lung yielded 
at least one positive sputum specimen. 

False positive smears were obtained in four patients. In two of 
these cases, only one of the nine smears examined was incorrectly 
diagnosed. The percentage of patients without neoplastic disease of 
the lung who were incorrectly reported positive is 4/67 X 100, or 6.0 
per cent. 

Before attacking some of the larger problems posed by the data 
thus far presented, two questions may be resolved by routine Chi-square 
examinations. We wish to determine whether any one of three morning 
specimens obtained from a patient with cancer of the lung is especially 
apt to harbor neoplastic cells. Table 3 gives the number of positive 
first, second, and third specimens obtained in this study. It is assumed 
of course, that all specimens were collected, processed, and examined 
with equal care. 
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TABLE 3 
Numbers of positive first, second, and third specimens obtained } 
NO. OF POSITIVE NO. OF POSITIVE NO. OF POSITIVE 
1ST SPECIMENS 2ND SPECIMENS 3RD SPECIMENS 
22 25 21 
Total: 68 





If, indeed, no one specimen is especially apt to contain neoplastic  / 
cells, then each category of Table 3 should include one-third of the 
total, or 22.67. The Chi-square (x*) test judges the significance of the 
discrepancy between the observed and expected situations: 


y= es __ (0.67)? , (2.33)* | (1.67)? 
E 





22.67 22.67 22.67 } 


= 0.38, with two degrees of freedom. 


The probability of the chance occurrence of such a value is .83. It 
is clear that all specimens are equally capable of containing neoplastic 
cells. 

Exactly the same question may be asked of the three slides made 
from a single specimen. Table 4 exhibits the requisite data for first, 
second, and third slides. The same assumptions held here as in the 
analysis of specimens. 

TABLE 4 


Numbers of positive first, second, and third slides obtained 








NO. OF POSITIVE NO. OF POSITIVE NO. OF POSITIVE 
IST SLIDES 2ND SLIDES 3RD SLIDES 
40 42 46 


Total: 128 





The expected number in each category of Table 4 is 42.67. Then, 


» (2.67)? , (0.67)? 
xe 42.67 42.67 


(3.33)? } 
42.67 0.44. 





ot 


The probability attached to this value of x? is .80. Clearly no one 
slide is more likely than another to contain tumor cells. 

At this point the data of Table 2 may be examined to estimate the 
probability that a single specimen from a cancer patient will reveal 
neoplastic cells. We have a total of 198 specimens, of which 68 
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(= 13.1-+ 11.2 +4 11.3) are positive. The chance that a specimen 
taken at random is positive may thus be estimated as 


P = 68/198 = .343. 


It is, however, to be noted that this estimate has tacitly assumed 
that the probability of a positive specimen is the same for all patients. 
This can be tested by comparing the observed distribution of Table 2 
with that calculated from the binomial 

(.343 + .657)*. 

The last column of Table 2 shows this comparison. It is at once 
evident that the observed distribution differs strongly from the calcu- 
lated, in that there are too many observed cases of no positive specimens, 
and of three positive specimens. This suggests strongly that (as one 
might reasonably expect) there are considerable differences between 
patients in their chances of producing specimens containing neoplastic 
cells. 

The next problem is to estimate the probability that any one of 
three smears prepared from a single positive specimen will contain neo- 
plastic cells. Table 5 gives the frequency of specimens with one, two, 
or three positive smears. 


TABLE 5 


Numbers of specimens with one, two, or three positive smears 





NUMBER OF POSITIVE SMEARS 
| 2 3 Total 
Number of specimens 25 26 17 68 











The method of maximum likelihood will be employed. However, 
the mathemtical model will be modified somewhat since the discovery 
of a positive sputum specimen requires that at least one positive smear 
be reported. Therefore, in the case of a known positive specimen the 
probability of at least one positive smear must be unity. It is very 
likely, however, that many specimens contain neoplastic cells but yield 
no positive smears. The adjusted model will take into account undis- 
covered positive specimens. 

Let a; (k = 1,- + +,n) represent the observed frequency of k positive 
smears out of n (here, n=3). The probability that the number z of 
positive smears equals a given number & is expressed by the binomial 
formula : 
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— — ne k(1 — p)n* 
Now, 
P[r= 0] = (1—p)", 
and 
P[z~0] =1—(1—p)". (2) 
Then, the probability that z = k, given that zr ~ 0 (i. e., the situation 
in this problem), may be expressed as 
C[p(1—p)"*] , 
sr 
The probability of obtaining the observed distribution of frequencies 
is given by the expression 


; oo! Red stall 
P(E] =¢. TI pe] ; (4) 


P[z=—k|r0)] = 





(3) 





Differentiating logarithmically with respect to the parameter p and 
setting the result equal to zero, we get, after some rearrangement, 
(1 — p)& ka, — p3(n—k)ay _ n(l_—p)"s 4% 
P 1—(1—p)* ° 
Substituting the values, a,, from Table 5, for n = 3, and transposing, 
one obtains 





(5) 


128(1—p)— 6p —_ 204(1— p)* __ 
Pp 1—(1—p)® 





(6) 


Estimates of p may be obtained to any desired precision by a process 
of iteration. Trial values of p are substituted into (6) and the left- 
hand side calculated. The resulting value determines the next estimate 
of p to be interpolated. The details are given in Table 6. 


TABLE 6 


The iteration solution of equation 6 

















p m9) 6 58 582 5815 
128 (1 — p)—76p 52.00 9.33 16.69 15.931 16.1204 
p 
204(1 — p)* 29.14 13.95 16.32 16.072 16.1360 
1—(1—p)* 
Difference 22.86 — 4.62 0.37 —0.141 — 0.0156 





1 This formula is a simplified expression of the well-known “ Bayes’ formula 
for probabilities of hypotheses,” a discussion of which may be found in Uspensky’s 
textbook on mathematical probability (1937). 
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Correct to three decimal places, 
p = .581. 


This is an estimate of the probability that neoplastic cells will be 
transferred from a specimen to any one smear made from a small 
portion of that specimen. 


SOME PREVIOUS STUDIES OF HUMAN VARIATION 


The experimental methods employed in the present study of human 
variation were derived in large part from two recent papers, one by 
Yerushalmy (1947), the other by Birkelo, Yerushalmy, et al. (1947).? 

Yerushalmy and his associates discussed subjective variations in 
medical diagnosis with special reference to roentgenographic techniques 
used in tuberculosis case-finding surveys. The material for the study 
was drawn from a group of 1,256 individuals who were X-rayed con- 
secutively on four different X-ray techniques, yielding for each person 
a 35 mm. photofluorogram of the chest, a 4” & 10” stereophotofluorogram, 
a roentgenogram on 14” X 17” paper negative, and a conventional 
14” & 17” celluloid film. 

These four sets of chest films of different sizes were distributed 
among five expert radiologists who independently diagnosed each film. 
Analysis of the results afforded a measure of “ inter-individual ” error, or 
“the inconsistency of interpretation found among different readers.” 
In a typical case, one reader called 59 films positive for tuberculosis. 
Of these, the other observers read, respectively, 31 per cent, 29 per cent, 
37 per cent, and 14 per cent negative for tuberculosis, although in many 
of these cases the presence of some non-tuberculous lesion was indicated 
(e. g., pulmonary scar, cancer, etc.). In an average of 12 per cent of 
the films the diagnoses varied from positive for tuberculosis to com- 
pletely negative. 

Following an interval of two to three months, the 14” < 17” celluloid 
films were re-circulated among the radiologists and results obtained 
which measure the “ intra-individual ” error, or the “ failure of a reader 
to be consistent with himself in two independent interpretations of the 
same set of films.” In the case of a radiologist who struck about the 
average in consistency, the writers showed that of 83 films which this 


*For a review of earlier studies of subjective variations in diagnosis, see 
Harris, 1950, Ch. VI. 
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radiologist called positive for tuberculosis on the first reading, 18 per 
cent were diagnosed negative for tuberculosis on the second reading, 
two-thirds of these completely negative. On the other hand, this 
observer diagnosed 88 films positive for tuberculosis on the second 
reading and of these 29 per cent had been called negative for tuberculosis 
on the first reading. 

After discussing several methods of analysis, Yerushalmy and his 
associates concluded that no one technique was more efficient than 
another in detecting cases of tuberculosis. They recommended that in 
mass survey work all films be read independently by at least two 
interpreters. 


THE ACCURACY OF CYTOLOGIC TECHNICIANS 


The present analysis was made to estimate the probability that a 
non-medical technician well-trained in cytologic diagnosis will detect 
neoplastic cells contained in a sputum smear. As mentioned previously, 
four such persons independently examined a group of one hundred 
sputum smears. Each smear suspected of containing neoplastic cells 
was brought to the consulting pathologist for confirmation. 

Table 7 reflects the relatively large variation exhibited by the four 
technicians, A, B, C, and D. The second column of this table gives the 


TABLE 7 


Variation shown by technicians in diagnosing one hundred sputum smears 








NUMBER OF PERCENTAGE READ POSITIVE BY 

TECHNICIAN POSITIVE SMEARS A B C D 
A 34 62 53 29 
B 32 66 69 22 
Cc 33 55 67 30 
D 16 62 44 62 





number of confirmed positive smears detected by each observer on the 
first reading. The remainder of the table lists the percentage of those 
smears called positive by each of the other independent observers. Thus, 
for example, Technician B diagnosed 32 slides positive. Of these, Tech- 
nician A called 66 per cent positive, Technician C called 69 per cent 
positive, and Technician D called 22 per cent positive. 

The mathematical model used to analyze differences between tech- 
nicians in the interpretation of sputum smears was introduced by 
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Neyman (1947) in a treatment of the roentgenographic data presented 
by Yerushalmy and his associates. 

Consider a population of sputum smears divided into two exclusive 
categories: (1) smears showing no evidence of neoplastic growth, whose 
proportion is «; (2) smears positive for neoplastic cells, whose pro- 
portion is B= 1—a. 

The parameters + and p are postulated to be the probabilities of 
correct diagnosis for categories (1) and (2), respectively. Hence, 1—~- 
represents the probability of a non-neoplastic smear being diagnosed as 
containing neoplastic cells (i.e., the probability of a false positive). 
Similarly, 1 — p denotes the probability of missing a smear containing 
neoplastic cells (i. e., the probability of a false negative). 

Applying the notation used by Neyman, let N represent the number 
of sputum smears included in the sample and n the number of inde- 
pendent diagnoses made for each smear. Denote by k& the number of 
positive diagnoses made for each smear. Then k may assume any one 
of the n+ 1 values, 0, 1,2,---,n. Let Ny stand for the number of 
smears in the sample for which there were exactly & positive diagnoses. 
The relative frequencies, q,, of exactly k positive diagnoses are obtained 
by dividing V;/N. There are n+ 1 such frequencies, but only n of 
them are independent since they must add to unity. 

Table 8 gives the observed frequencies of smears with k positive 
diagnoses. 








TABLE 8 
Absolute and relative observed frequencies of sputum smears 
with k (0,1,. -.,4) positive diagnoses 
k ABSOLUTE FREQUENCY RELATIVE FREQUENCY 
0 46 46 
1 17 17 
2 16 16 
3 18 18 
4 3 .03 
Total 100 1.00 





In the construction of a mathematical model, three independent 
parameters, a, 7, and p, have been postulated. Moreover, the goodness 
with which the model fits the observed frequencies must be tested. 
Therefore, at least four independent diagnoses are required for each 
smear. The results obtained from the four technicians cooperating in 
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this study satisfy this requirement assuming that all technicians are 
equally competent to report on sputum smears. 

Then, the probability that n diagnoses will give exactly k positive 
results is given by 


F 


and 


! 
b= Fp fal — ate + (1 — 2) phd — ps, for ke <n, 


P, = 1—a{1—(1—x)"] —(1—2) [1—p*]. (7) 


Using these n+ 1 equations, the three unknown parameters ‘may 
be estimated according to the method of best asymptotically normal 
estimates described by Neyman (1945-46). 

By eliminating the three unknown parameters from the system of 
four independent equations (7) an equation may be obtained of the 
form 

F, (Px) = 9, (8) 


which serves as a final restriction on the parameters. 
For n = 4, this equation must be 


72PoP2P, + 9P,P2P3 — 2P.3 — 27P,P 3? — 27P,7P, =0.* (9) 


If the right-hand sides of the system of equations (7) are substituted 
for the probabilities, P;, in (9), it may be shown that (9) represents 
an identity in the parameters a, 7, and p. 

Using (9), one immediately obtains 


F'e(qi) = 72qoG2qs + 9919293 — 2G2° — 27qogs* — 27q:7q, (10) 


in terms of the observed relative frequencies given in Table 8. 
The discrepancy between the “observed ” equation, /;(q;), and the 
“expected ” model, F;(P;), may be measured by expanding the latter 


in a Taylor series about the point P; —q;. Thus, 


F, (Px) = Fi(qi) + ¥ bea(Pr—a) + R, 


The remainder R tends in probability to zero as Nx. The 
coefficients, b;;, represent the first order partial derivatives of F;(P,) 
with respect to the expected frequencies, Py. Neyman (1945-46) has 


’ The writer is indebted to Dr. Werner Leimbacher of the Statistical Labora- 
tory of the University of California for the mathematical derivation of the 
equation (9). This derivation is reproduced in the Appendix. 
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shown, however, that these partial derivatives have the same values as 
the first order partial derivatives, a;;, obtainable from the expression 
F;(qi) at the point Py, = qi. 

The terms, a:,:(P.— qi), may be obtained by differentiating the 
expression /’;(qi) with respect to the observed frequencies, qj. In 
keeping with the hypothesis P;, = qi, one may assume that 


a= > Git = 0. 
ia 


Then, the expression 


has a Chi-square (x*) distribution with one degree of freedom. This 
distribution is used to test the hypothesis that the system of equations 
(7) with three unknown parameters adequately explains the observed 
frequencies. 
Inserting the numerical values from Table 8, 
F, (qi) = 0.158976 +- 0.044064 — 0.008192 — 0.402408 — 0.023409 


= — (1.230969. 
The partial derivatives, aj, (i —=0,1,---,4), are then computed: 
dy = 724294 — 27qs? = — 0.5292, 
a, = 99293 — 549144 = — 0.0162, 
As = 720s + 9919s — 6q2” = 1.1154, 
rE = 94:92 amas 5404s —=- 4.2264, 


a= 7202 —— 27q," = 4.5189. 
Hence, 


—] 


= & qidi = — 0.449475, 
and 
2 == & gia,” — a? = 3.824931. 


P(g) pee 053347 
yw [EE | — 100 | ora | — 1.395, 


Then, 


with one degree of freedom. 

The probability of the chance occurrence of a x* value greater than 
this is .24. Evidently the proposed model (7) adequately represents 
the mechanism underlying the observed frequencies. 
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In order to estimate the parameters a, 7, and p, approximations P, 
of the P, in equations (7) are computed: j 


Pyeng tt — “= -Fi(q)], t= 0,1,-°-,4. (12) 





Substituting numerical values: 


P, = .46(1 — 0.004814) — .457786, 

P, = .17(1 + 0.026163) — 174448, 

P, = .16(1 + 0.094495) — .175119, 

P, = .18(1 — 0.228070) — .138947, 

P, = .03(1 + 0.300016) = .039000. 
_ Incontrast to the observed frequencies, qi, these estimated frequencies, | 
P;, are independent and therefore do not have to add to unity. 


The case n = 4, requires computing the coefficients: 
2(6P.P; — P,P) 
8P.P2 —7 3P;? 


9P,P, cosine 4P,” 
~ 3(8PoP2 — 3P;°) 


= 1.276631, 








B = ().057865, 


and solving the quadratic 
r?*—Ar+B=0, 


with z, the smaller solution and x, the larger solution. 
These solutions are 
2, = 0.047061, 


Lo = 1.229570. 





Then, 
a 1 RANA 5 
ts ~~ = .955054 (= .955), 
and 
a Z2 
T 72 


The parameter a is obtained from the formula 


4P.r. — P; 
4(r. — 2) 


a= 


(1+ 2,)4—=.527807  (—.528). 


The estimate obtained for ~ indicates that a proportion of 4.5 per 
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cent false positive diagnoses may be expected when suspicious smears 
are brought to a consulting pathologist for confirmation. This figure 
agrees well with percentages obtained in previous investigations in which 
the pathologist diagnosed smears without preliminary screening by 
technicians. As mentioned previously, the pathologist alone is respon- 
sible for false positive diagnoses. 
The estimate 
p= .551 


means that if any one of the four technicians in this study were given 
one hundred sputum smears each showing some evidence of neoplasia. 
about 55 positive diagnoses might be expected. 


STANDARD ERRORS OF THE PARAMETERS 


Let 6,(q), a function of qo,q:,° °°, 4s, be a BAN estimate of 6,. 
Then, Taylor’s formula gives 


e 4 
6, = 6, + 2 ae (qe — Pr) + R, 


where 


and where the remainder FR tends in probability to zero as No. 
Then, the asymptotic variance of 6, is given by, say, 


2 1 . 
= — 2 
oe Hy DP 


However, the computation of the partial derivatives, a,, is often 
quite laborious, and Neyman has indicated a much more convenient 
method of obtaining the variance. 

As illustrated by equations (7), P; is a function of three independent 
parameters 6, 42, 4s, 

P, —= f(%, 62, 6;). 

Let 


and 





Sf *#e @e @ £8 see 5 
Et — : 











198 EUGENE K. HARRIS 


Let 

12 Gis 
2 Gos 
G31 Gg. T 33 


and denote by Az, the co-factor of A corresponding to G;,m. 
Then, Neyman has shown that 


3 
D> fetAr1 
= ay = ~ .. > 


fd 


00; 
09x | Qu = Px 
and the asymptotic variance of @, is given by 
© D> » 


° Ais 
Ce = NA . 
Let 6, =f, 6. = p, 0; = a. 
Then, calculating the partial derivatives of P, with respect to z, 
p, and a, and substituting =, p = p, =a and P, = f;, one obtains 


A =—=120.1825, 
Ai,1—= 13.5681, 
Aco == 48.7523, 


As3.3 = 107.1 123, 


and, for N = 100, os = + .034, of = + .064, of = + .094. 


INTRA-INDIVIDUAL VARIATION 


As described above, three of the four observers, including the writer, 
re-examined selected sets of one hundred smears which had been read 
for the first time about three months previous. In a study of variation 
“ within ’ 
is especially interested in reversals of diagnosis from positive to negative 


’ observers, or the consistency of an observer’s diagnosis, one 
and vice versa. Table 9 shows the frequency with which these reversals 
occurred. 

Table 9 indicates that two observers diagnosed more slides positive 


on the second reading than on the first reading, while in the case of 
one observer the reverse situation obtained. Therefore, it cannot be 
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maintained that during the course of the study the observers showed 
significant improvement in ability to detect smears containing neoplastic 


cells. 
TABLE 9 


Reversals of diagnoses as a measure of intra-individual variation 








OBSERVER NO. OF REVERSED DIAGNOSES TOTAL PERCENTAGE 
REVERSAL 
Pos. > Neg. Neg. Pos. (100 smears ) 
A 5 9 14 14% 
B 8 19 27 27% 
C 12 9 21 21% 





Based on a small number of observers, a mean percentage reversal 
of 21 per cent is indicated with a standard deviation of + 6.5 per cent. 
Considering the estimated value, p, the mean percentage reversal is not 
surprising. It is natural that on each successive reading a technician 
will discover positive smears missed at previous examinations and, at 
the same time, fail to report smears previously called positive. This 
measure of intra-individual variation in no way invalidates the estimates 
of the parameters a, x, and p. 


DISCUSSION AND RECOMMENDATIONS 


The aim of this section is to recommend some applications of the 
statistical results to the routine examination of sputum for neoplastic 
cells. Material for these examinations will be drawn from two classes 
of subjects, (1) hospitalized patients suspected of some chest pathology, 
possibly carcinoma, and (2) ambulatory persons appearing at a clinic 
for reasons perhaps unrelated to bronchogenic carcinoma but having a 
productive cough which gains the attention of the physician. 

Sputum examined in the present study was collected entirely from 
hospitalized patients. In all the cases in which neoplastic cells were 
found in the sputum, clinical and roentgenographic findings had already 
supported a suspicion of carcinoma. In only 17 per cent of these cases 
was the lesion operable. 

The greatest value of sputum diagnosis lies in the microscopic proof 
of early cancer. Therefore, for the most gratifying results, sputum 
examinations for neoplastic cells should be performed on every clinic 
patient with a productive cough. The results of the present study cannot 
be applied indiscriminately to ambulatory persons, and further research 
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is necessary before sputum diagnosis may be evaluated with regard to 
this group. Certainly in such a program the ratio of negative to positive 
smears would be overwhelming and the time and efforts of cytologic 
technicians would not be spent economically. 

A valuable undertaking would be an investigation into the accuracy 
of pathologists in judging sputum smears. A consulting pathologist 
must bear the blame for some of the false negative as well as all of the 
false positive diagnoses. The present study may serve as a model for 
an assessment of sputum diagnoses made by pathologists. 

An analysis given in the previous section indicated that one tech- 
nician will fail to detect positive smears about 45 per cent of the time. 
Further calculation shows that 69 per cent of the positive smears will 
be correctly reported when two technicians independently read the same 
slide. Sensitivity (i.e., the probability of detecting positive smears) 
rises to 86 per cent when three technicians report independently on one 
slide and to 94 per cent when four independent diagnoses are made. 

Since the examination of a sputum smear is a very time-consuming 
task, no more than two technicians can be expected to examine each 
smear. The increase in sensitivity obtained when two technicians work 
independently suffices to raise the probabilities calculated earlier in this 
paper. The probability that one of three specimens from a patient with 
carcinoma will be positive is raised from .343 to .422. Similarly, the 
probability that one of three smears from a positive specimen will be 
positive is increased from .581 to .602. 

In view of these adjusted probabilities, applicable to hospitalized 
patients, it seems advantageous to collect at least four specimens from 
each patient and prepare two slides from each specimen. The time 
spent in collecting specimens is almost negligible. On the other hand, 
twenty to thirty minutes are required to examine systematically each 
slide. Therefore, when sputum diagnosis is contemplated as a routine 
procedure, the number of slides to be examined assumes great importance. 
Examination of two slides from each of four specimens will require 
less time than three slides from each of three specimens and will probably 
uncover more cases of bronchogenic carcinoma. 

It is further recommended that two technicians independently report 
on each smear prepared. In order to assume a volume of work sufficient 
for a group of technicians, the cytologic laboratory should be situated 
in a large hospital which can draw material from private physicians and 
smaller institutions within a wide radius. Lastly, a pathologist with 
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special training in sputum cytology must be present to render final 
diagnosis. 
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TECHNICAL APPENDIX 
Derivation of equation (9): F;( Px) =0. 


Assuming equations (7), for n = 4, introduce the following changes 
of variables: 





bem ay Po (i—0,1,-- +, 4), (A. 1) 
ant = w (1—2)(1—p)* =», 
sie : (A. 2) 
~aas 7" 
Then, 
{i,—= u+v 
A, = su+ tv 
A, =s*u + tv (A. 3) 
A; = s*u + fp 
A, = stu + fv. 


The first three linear equations in two unknowns, u and v, can hold 
simultaneously only if the determinant 


Ao 1 1 
A 8 = 6t [ (A. 4) 
Az s* t? 


for unless this condition holds in a system of (m+ 1) equations in n 
unknowns, the equations are inconsistent. 
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Likewise, the following conditions hold: 


A, $ t A, 1 1 
Az s? t? | mest] Ao 8 t | =0, (A. 5) 
As # - Az s* a 
A; s* F Az 1 1 
A; s* i | = s*l?| A; s t | =—0. (A. 6) 
Ag g* i* A, s* §* 





These determinants are expanded to obtain the equations 


A,— 2A, + yA, = 0 (A. 4’) 
A,— 2rA,+ yA; = 0 (A. 5’) 
A,—aZa; + yA,=0 (A. 6’) 


where r=—s+t, y=st. 
Again, to be consistent, these three equations in two unknowns, 
z and y, can hold simultaneously only if 


| A, A, Ay 
As A, A, —_ 0. (A. 7) 


Expanding (A. 7), 
A,} —— 2A,A.,A; — A,*A, a A,A;? a AvAA, —_ 0. (A. 8) 
Substituting in (A.8) according to equations (A. 1), equation (9) 
is obtained. 


Equations used in the estimation of the parameters, # and p, are 
obtained as follows: 


Rewriting equations (A. 4’) and (A. 5’) in terms of s and f, 
A, — A,s — A,t + Ayst — 0, 
A; — A.s — Apt + A,st = 0. 
Considering only one variable, ¢ for example, these two linear 
equations will hold simultaneously only if 
| (A,—A,s) (Aes — Ai) | 


| == Q. A. 
| (A; — A2s) (A,s — A.) | - " 
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Expanding (A. 9): 
s*(AyAy — A,*) —8(Aods — ArAz) + (414s — 4g") = 0. (A. 10) 


‘ The same result will hold in terms of ¢ if (A. 4’) and (A. 5’) are 
considered with respect to the single variable s. 

Dividing equation (A.10) by (A,A,— A,*) and substituting for 
the A’s according to equations (A.1), one obtains the coefficients, 
A.and B, given in the text. 

The formula for the parameter 2 is derived from the first two 


equations of the system of equations (A.3). Substitute for u, v, s, ¢ 
according to equations (A. 2) and solve the resulting expression for a, 








